Mutated gene from corynebacterium glutamicum

ABSTRACT

The invention relates to a novel nucleic acid molecule, to the use thereof for constructing genetically improved microorganisms and to methods for preparing fine chemicals, in particular amino acids, with the aid of said genetically improved microorganisms.

BACKGROUND OF THE INVENTION

Particular products and byproducts of naturally occurring metabolic processes in cells are used in many branches of industry, including the food industry, the animal feed industry, the cosmetic industry and the pharmaceutical industry. These molecules which are collectively referred to as “fine chemicals” comprise organic acids, both proteinogenic and nonproteinogenic amino acids, nucleotides and nucleosides, lipids and fatty acids, diols, carbohydrates, aromatic compounds, vitamins and cofactors and also enzymes. They are best produced by means of cultivating, on a large scale, bacteria which have been developed to produce and secrete large amounts of the molecule desired in each particular case. An organism which is particularly suitable for this purpose is Corynebacterium glutamicum, a Gram-positive nonpathogenic bacterium. Using strain selection, a number of mutant strains have been developed which produce various desirable compounds. The selection of strains which have been improved with respect to the production of a particular molecule is, however, a time-consuming and difficult process.

BRIEF DESCRIPTION OF THE INVENTION

The present invention provides novel nucleic acid molecules which can be used for identifying or classifying Corynebacterium glutamicum or related bacteria species. C. glutamicum is a Gram-positive aerobic bacterium which is widely used in industry for the large-scale production of a number of fine chemicals and also for the degradation of hydrocarbons (e.g. in the case of crude oil spills) and for the oxidation of terpenoides. The nucleic acid molecules may therefore be used for identifying microorganisms which can be used for producing fine chemicals, for example by fermentation processes. Although C. glutamicum itself is nonpathogenic, it is, however, related to other Corynebacterium species, such as Corynebacterium diphtheriae (the diphtheria pathogen), which are major pathogens in humans. The ability to identify the presence of Corynebacterium species may therefore also be of significant clinical importance, for example in diagnostic applications. Moreover, said nucleic acid molecules may serve as reference points for mapping the C. glutamicum genome or the genomes of the related organisms.

These novel nucleic acid molecules encode proteins which are referred to herein as Marker- and fine chemical-producing (MCP) proteins. These MCP proteins may, for example, be involved directly or indirectly in the production of one or more fine chemicals in C. glutamicum. The MCP proteins of the invention may also be involved in the degradation of hydrocarbons or in the oxidation of terpenoides. Said proteins may be used for identifying Corynebacterium glutamicum or organisms related to C. glutamicum; the presence of an MCP protein specific for C. glutamicum and related species in a mixture of proteins may indicate the presence of one of said bacteria in the sample. Furthermore, these MCP-protein may have homolog in plants or animals, which are involved in a diseased state or an illness; thus, these proteins may serve as useful pharmaceutical targets for drug screening and the development of therapeutic compounds.

Owing to the availability of cloning vectors which can be used in Corynebacterium glutamicum, as disclosed, for example, in Sinskey et al., U.S. Pat. No. 4,649,119, and of techniques for the genetic manipulation of C. glutamicum and the related Brevibacterium species (e.g. lactofermentum) (Yoshihama et al., J. Bacteriol. 162:591-597 (1985); Katsumata et al., J. Bacteriol. 159:306-311 (1984); and Santamaria et al. J. Gen. Microbiol. 130:2237-2246 (1984)), the nucleic acid molecules of the invention can be used for genetic manipulation of said organism in order to modulate the production of one or more fine chemicals. This modulation may take place due to a direct effect of the manipulation of a gene of the invention or due to an indirect effect of such a manipulation. For example, it is possible, by modifying the activity of a protein which is involved in the biosynthesis or degradation of a fine chemical (i.e. by mutagenesis of the corresponding gene), to directly modulate the ability of the cell for synthesizing or degrading said compound, thereby modulating the yield and/or the efficiency of production of the fine chemical. Likewise it is possible, by modulating the activity of a protein which regulates a fine-chemical metabolic pathway, to influence directly, whether the production of the desired compound is up or down regulated, both of which modulate the yield or efficiency of production of the fine chemical from the cell.

The indirect modulation of the production of fine chemicals may also be carried out by modifying the activity of a protein of the invention (i.e. by mutagenesis of the corresponding gene) so that the ability of the cell to grow and to divide or to remain viable or productive is increased overall. The production of fine chemicals from C. glutamicum is usually achieved by large-scale fermentative culturing of said microorganisms conditions which are frequently suboptimal for growth and cell division. Modifying a protein of the invention (e.g. a stress reaction protein, a cell wall protein or a protein involved in the metabolism of compounds which are necessary for cell division and cell growth to take place, such as nucleotides and amino acids) such that survival, growth and propagation under said conditions can be improved, can make it possible to increase the number and the productivity of said modifed C. glutamicum cells in large-scale cultures, and this in turn should increase the yields and/or the efficiency of production of one or more of the desired fine chemicals. Furthermore, the metabolic pathways of a cell are necessarily dependent on one another and coregulated. Changing the activity of any metabolic pathway in C. glutamicum (i.e. changing the activity of one of the proteins of the invention, which is involved in such a pathway) makes it possible to change simultaneously the activity or regulation of another metabolic pathway in this microorganism, which may be involved directly in the synthesis or degradation of a fine chemical.

The present invention provides novel nucleic acid molecules encoding proteins which are referred to here as MCP proteins and which are, for example, capable of modulating the production or the efficiency of production of one or more fine chemicals in C. glutamicum or of serving as identification markers for C. glutamicum or related organisms. Nucleic acid molecules encoding an MCP protein are referred to here as MCP nucleic acid molecules. In a preferred embodiment, the MCP protein is capable of modulating the production or the efficiency of production of one or more fine chemicals in C. glutamicum or of serving as identification markers for C. glutamicum or related organisms. Examples of such proteins are those encoded by the genes listed in Table 1.

Consequently, one aspect of the invention relates to isolated nucleic acid molecules (e.g. cDNAs) comprising a nucleotide sequence which encodes an MCP protein or biologically active sections thereof and also nucleic acid fragments which are suitable as primers or hybridization probes for detecting or amplifying MCP-encoding nucleic acid (e.g. DNA or mRNA). In particularly preferred embodiments, the isolated nucleic acid molecule comprises any of the nucleotide sequences listed in Appendix A or the coding region of any of these nucleic acid sequences or a complement thereof. In other preferred embodiments, the isolated nucleic acid molecule encodes one of the amino acid sequences listed in Appendix B. The preferred MCP proteins of the invention likewise have preferably at least one of the MCP activities described herein.

Appendix A defines hereinbelow the nucleic acid sequences of the sequence listing together with the sequence modifications at the relevant position, described in Table 1.

Appendix B defines hereinbelow the polypeptide sequences of the sequence listing together with the sequence modifications at the relevant position, described in Table 1.

In a further embodiment, the isolated nucleic acid molecule is at least 15 nucleotides in length and hybridizes under stringent conditions to a nucleic acid molecule which comprises a nucleotide sequence of Appendix A. The isolated nucleic acid molecule preferably corresponds to a naturally occurring nucleic acid molecule. The isolated nucleic acid more preferably encodes a naturally occurring C. glutamicum MCP protein or a biologically active section thereof.

Another aspect of the invention relates to vectors, for example recombinant expression vectors, which contain the nucleic acid molecules of the invention and to host cells into which said vectors have been introduced. In one embodiment, an MCP protein is prepared by using a host cell which is cultivated in a suitable medium. The MCP protein may then be isolated from the medium or the host cell.

Another aspect of the invention relates to a genetically modified microorganism into which an MCP gene has been introduced or in which an MCP gene has been modified. In one embodiment, the genome of said microorganism has been modified by introducing at least one inventive nucleic acid molecule which encodes the mutated MCP sequence as transgene. In another embodiment, an endogenous MCP gene in the genome of said microorganism has been modified, for example functionally disrupted, by homologous recombination with a modified MCP gene. In a preferred embodiment, the microorganism belongs to the genus Corynebacterium or Brevibacterium, with Corynebacterium glutamicum being particularly preferred. In a preferred embodiment, the microorganism is also used for preparing a compound of interest, such as an amino acid, particularly preferably lysine.

Another preferred embodiment are host cells having more than one of the nucleic acid molecules described in Appendix A. Such host cells can be prepared in various ways known to the skilled worker. They may be transfected, for example, by vectors carrying several of the nucleic acid molecules of the invention. However, it is also possible to use a vector for introducing in each case one nucleic acid molecule of the invention into the host cell and therefore to use a plurality of vectors either simultaneously or sequentially. Thus it is possible to construct host cells which carry numerous, up to several hundred, nucleic acid sequences of the invention. Such an accumulation can often produce superadditive effects on the host cell with respect to fine-chemical productivity.

Another aspect of the invention relates to an isolated MCP protein or to a section, for example a biologically active section, thereof. In a preferred embodiment, the isolated MCP protein or its section is capable of modulating the production or the efficiency of production of one or more fine chemicals in C. glutamicum or of serving as identification markers for C. glutamicum or related organisms. In another preferred embodiment, the isolated MCP protein or a section thereof is sufficiently homologous to an amino acid sequence of Appendix B for the protein or its section to still be capable of, for example, modulating the production or the efficiency of production of one or more fine chemicals in C. glutamicum or of serving as identification markers for C. glutamicum or related organisms.

Moreover, the invention relates to an isolated MCP-protein preparation. In preferred embodiments, the MCP protein comprises an amino acid sequence of Appendix B. In a further preferred embodiment, the invention relates to an isolated full-length protein which is essentially homologous to a complete amino acid sequence of Appendix B (which is encoded by an open reading frame in Appendix A).

The MCP polypeptide or a biologically active section thereof may be functionally linked to a non-MCP polypeptide to produce a fusion protein. In preferred embodiments, this fusion protein has a different activity from that of the MCP protein alone. In other preferred embodiments, said fusion protein is capable of modulating the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum or of serving as identification marker for C. glutamicum or related organisms. In particularly preferred embodiments, integration of said fusion protein into a host cell modulates the production of a compound of interest by the cell.

Another aspect of the invention relates to a method for preparing a fine chemical. The method provides for the cultivation of a cell containing a vector which causes the expression of an MCP nucleic acid molecule of the invention so that a fine chemical is produced. In a preferred embodiment, this method additionally comprises the step of obtaining a cell containing such a vector, said cell being transfected with a vector which causes expression of an MCP nucleic acid. In a further preferred embodiment, said method additionally comprises the step in which the fine chemical is obtained from the culture. In a particularly preferred embodiment the cell belongs to the genus Corynebacterium or Brevibacterium.

Another aspect of the invention relates to method for modulating the production of a molecule from a microorganism. These methods comprise contacting the cell with a substance which modulates the MCP-protein activity or the expression of MCP nucleic acids such that a cell-associated activity is modified in comparison with the same activity in the absence of said substance. In a preferred embodiment, the cell is modulated with respect to one or more C. glutamicum MCP-protein activities so that the yield, production and/or efficiency of production of a fine chemical of interest by said microorganism is improved. The substance which modulates the MCP-protein activity may be a substance which stimulates MCP-protein activity or expression of MCP nucleic acids. Examples of substances stimulating said MCP-protein activity or expression of MCP nucleic acids include small molecules, active MCP proteins and nucleic acids which encode MCP proteins and have been introduced into the cell. Examples of substances which inhibit MCP activity or MCP expression include small molecules and antisense MCP nucleic acid molecules.

Another aspect of the invention relates to methods for modulating the yields, the production and/or the efficiency of production of a compound of interest from a cell, comprising introducing an MCP wild-type gene or MCP-mutant gene into a cell, which gene either remains on a separate plasmid or is integrated into the genome of the host cell. Integration into the genome may take place randomly or via homologous recombination, so that the native gene is replaced by the integrated copy leading to the production of the compound of interest from the cell to be modulated. In a preferred embodiment, said yields are increased. In another preferred embodiment, the chemical is a fine chemical and is, in a particularly preferred embodiment, an amino acid. In a particularly preferred embodiment, this amino acid is L-lysine.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides MCP nucleic-acid and MCP-protein molecules which can be used for identifying Corynebacterium glutamicum or related organisms, for mapping the C. glutamicum genome (or the genome of a closely related organism) or for identifying microorganisms which can be used for producing fine chemicals, for example via fermentative processes. The proteins encoded by said nucleic acids may be used for directly or indirectly modulating the production or the efficiency of production of one or more fine chemicals in C. glutamicum, as identification markers for C. glutamicum or related organisms, for oxidizing terpenoids or for degrading hydrocarbons or as targets for developing therapeutic pharmaceutical compounds. The aspects of the invention are further illustrated below.

I. Fine chemicals

The term “fine chemicals” is known in the art and includes molecules which are produced by an organism and are used in various branches of industry such as, for example, but not restricted to, the pharmaceutical industry, the agricultural industry and the cosmetics industry. These compounds comprise organic acids such as tartaric acid, itaconic acid and diaminopimelic acid, both proteinogenic and nonproteinogenic amino acids, purine and pyrimidine bases, nucleosides and nucleotides (as described, for example, in Kuninaka, A. (1996) Nucleotides and related compounds, pp. 561-612, in Biotechnology Vol. 6, Rehm et al., Editors VCH: Weinheim and the references therein), lipids, saturated and unsaturated fatty acids (for example arachidonic acid), diols (e.g. propanediol and butanediol), carbohydrates (e.g. hyaluronic acid and trehalose), aromatic compounds (e.g. aromatic amines, vanillin and indigo), vitamins and cofactors (as described in Ullmann's Encyclopedia of Industrial Chemistry, Vol. A27, “Vitamins”, pp. 443-613 (1996) VCH: Weinheim and the references therein; and Ong, A. S., Niki, E. and Packer, L. (1995) “Nutrition, Lipids, Health and Disease” Proceedings of the UNESCO/Confederation of Scientific and Technological Associations in Malaysia and the Society for Free Radical Research—Asia, held Sept. 1-3, 1994, in Penang, Malaysia, AOCS Press (1995)), enzymes and all other chemicals described by Gutcho (1983) in Chemicals by Fermentation, Noyes Data Corporation, ISBN: 0818805086 and the references indicated therein. The metabolism and the uses of particular fine chemicals are further illustrated below.

A. Metabolism and Uses of Amino Acids

Amino acids comprise the fundamental structural units of all proteins and are thus essential for normal cellular functions in all organisms. The term “amino acid” is known in the art. Proteinogenic amino acids, of which there are 20 types, serve as structural units for proteins, in which they are linked together by peptide bonds, whereas the nonproteinogenic amino acids (hundreds of which are known) usually do not occur in proteins (see Ullmann's Encyclopedia of Industrial Chemistry, Vol. A2, pp. 57-97 VCH: Weinheim (1985)). Amino acids can exist in the optical D or L configuration, although L-amino acids are usually the only type found in naturally occurring proteins. Biosynthetic and degradation pathways of each of the 20 proteinogenic amino acids are well characterized both in prokaryotic and eukaryotic cells (see, for example, Stryer, L. Biochemistry, 3^(rd) edition, pp. 578-590 (1988)). The “essential” amino acids (histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan and valine), so called because, owing to the complexity of their biosynthesis, they must usually be taken in with the diet, are converted by simple biosynthetic pathways into the other 11 “nonessential” amino acids (alanine, arginine, asparagine, aspartate, cysteine, glutamate, glutamine, glycine, proline, serine and tyrosine). Higher animals are able to synthesize some of these amino acids but the essential amino acids must be taken in with the food in order that normal protein synthesis takes place.

Apart from their function in protein biosynthesis, these amino acids are interesting chemicals as such, and it has been found that many have various applications in the human food, animal feed, chemicals, cosmetics, agricultural and pharmaceutical industries. Lysine is an important amino acid not only for human nutrition but also for monogastric livestock such as poultry and pigs. Glutamate is most frequently used as flavor additive (monosodium glutamate, MSG) and elsewhere in the food industry, as are aspartate, phenylalanine, glycine and cysteine. Glycine, L-methionine and tryptophan are all used in the pharmaceutical industry. Glutamine, valine, leucine, isoleucine, histidine, arginine, proline, serine and alanine are used in the pharmaceutical industry and the cosmetics industry. Threonine, tryptophan and D/L-methionine are widely used animal feed additives (Leuchtenberger, W. (1996) Amino acids—technical production and use, pp. 466-502 in Rehm et al., (editors) Biotechnology Vol. 6, Chapter 14a, VCH: Weinheim). It has been found that these amino acids are additionally suitable as precursors for synthesizing synthetic amino acids and proteins, such as N-acetylcysteine, S-carboxymethyl-L-cysteine, (S)-5-hydroxytryptophan and other substances described in Ullmann's Encyclopedia of Industrial Chemistry, Vol. A2, pp. 57-97, VCH, Weinheim, 1985.

The biosynthesis of these natural amino acids in organisms able to produce them, for example bacteria, has been well characterized (for a review of bacterial amino acid biosynthesis and its regulation, see Umbarger, H. E. (1978) Ann. Rev. Biochem. 47: 533-606). Glutamate is synthesized by reductive amination of α-ketoglutarate, an intermediate product in the citric acid cycle. Glutamine, proline and arginine are each generated successively from glutamate. The biosynthesis of serine takes place in a three-step process and starts with 3-phosphoglycerate (an intermediate product of glycolysis), and affords this amino acid after oxidation, transamination and hydrolysis steps. Cysteine and glycine are each produced from serine, specifically the former by condensation of homocysteine with serine, and the latter by transfer of the side-chain β-carbon atom to tetrahydrofolate in a reaction catalyzed by serine transhydroxymethylase. Phenylalanine and tyrosine are synthesized from the precursors of the glycolysis and pentose phosphate pathway, and erythrose 4-phosphate and phosphoenolpyruvate in a 9-step biosynthetic pathway which diverges only in the last two steps after the synthesis of prephenate. Tryptophan is likewise produced from these two starting molecules but it is synthesized by an 11-step pathway. Tyrosine can also be prepared from phenylalanine in a reaction catalyzed by phenylalanine hydroxylase. Alanine, valine and leucine are each biosynthetic products derived from pyruvate, the final product of glycolysis. Aspartate is formed from oxalacetate, an intermediate product of the citrate cycle. Asparagine, methionine, threonine and lysine are each produced by the conversion of aspartate. Isoleucine is formed from threonine. Histidine is formed from 5-phosphoribosyl 1-pyrophosphate, an activated sugar, in a complex 9-step pathway.

Amounts of amino acids exceeding those required for protein biosynthesis cannot be stored and are instead broken down so that intermediate products are provided for the principal metabolic pathways in the cell (for a review, see Stryer, L., Biochemistry, 3^(rd) edition, Chapter 21 “Amino Acid Degradation and the Urea Cycle”; pp. 495-516 (1988)). Although the cell is able to convert unwanted amino acids into the useful intermediate products of metabolism, production of amino acids is costly in terms of energy, the precursor molecules and the enzymes necessary for their synthesis. It is therefore not surprising that amino acid biosynthesis is regulated by feedback inhibition, whereby the presence of a particular amino acid slows down or completely stops its own production (for a review of the feedback mechanism in amino acid biosynthetic pathways, see Stryer, L., Biochemistry, 3^(rd) edition, Chapter 24, “Biosynthesis of Amino Acids and Heme”, pp. 575-600 (1988)). The output of a particular amino acid is therefore restricted by the amount of this amino acid in the cell.

B. Metabolism and Uses of Vitamins, Cofactors and Nutraceuticals

Vitamins, cofactors and nutraceuticals comprise another group of molecules. Higher animals have lost the ability to synthesize them and therefore have to take them in, although they are easily synthesized by other organisms such as bacteria. These molecules are either bioactive molecules per se or precursors of bioactive substances which serve as electron carriers or intermediate products in a number of metabolic pathways. Besides their. nutritional value, these compounds also have a significant industrial value as colorants, antioxidants and catalysts or other processing auxiliaries. (For a review of the structure, activity and industrial applications of these compounds, see, for example, Ullmann's Encyclopedia of Industrial Chemistry, “Vitamins”, Vol. A27, pp. 443-613, VCH: Weinheim, 1996). The term “vitamin” is known in the art and comprises nutrients which are required for normal functioning of an organism but cannot be synthesized by this organism itself. The group of vitamins may include cofactors and nutraceutical compounds. The term “cofactor” comprises nonproteinaceous compounds necessary for the appearance of a normal enzymic activity. These compounds may be organic or inorganic; the cofactor molecules of the invention are preferably organic. The term “nutraceutical” comprises food additives which are health-promoting in plants and animals, especially humans. Examples of such molecules are vitamins, antioxidants and likewise certain lipids (e.g. polyunsaturated fatty acids).

The biosynthesis of these molecules in organisms able to produce them, such as bacteria, has been comprehensively characterized (Ullmann's Encyclopedia of Industrial Chemistry, “Vitamins”, Vol. A27, pp. 443-613, VCH: Weinheim, 1996, Michal, G. (1999) Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John Wiley & Sons; Ong, A. S., Niki, E. and Packer, L. (1995) “Nutrition, Lipids, Health and Disease” Proceedings of the UNESCO/Confederation of Scientific and Technological Associations in Malaysia and the Society for free Radical Research—Asia, held on Sep. 1-3, 1994, in Penang, Malaysia, AOCS Press, Champaign, Ill. X, 374 S).

Thiamine (vitamin B₁) is formed by chemical coupling of pyrimidine and thiazole units. Riboflavin (vitamin B₂) is synthesized from guanosine 5′-triphosphate (GTP) and ribose 5′-phosphate. Riboflavin in turn is employed for the synthesis of flavin mononucleotide (FMN) and flavin adenine dinucleotide (FAD). The family of compounds together referred to as “vitamin B₆” (for example pyridoxine, pyridoxamine, pyridoxal 5′-phosphate and the commercially used pyridoxine hydrochloride), are all derivatives of the common structural unit 5-hydroxy-6-methylpyridine. Panthothenate (pantothenic acid, R-(+)-N-(2,4-dihydroxy-3,3-dimethyl-1-oxobutyl)-β-alanine) can be prepared either by chemical synthesis or by fermentation. The last steps in pantothenate biosynthesis consist of ATP-driven condensation of β-alanine and pantoic acid. The enzymes responsible for the biosynthetic steps for the conversion into pantoic acid and into β-alanine and for the condensation to pantothenic acid are known. The metabolically active form of pantothenate is coenzyme A whose biosynthesis takes place by 5 enzymatic steps. Pantothenate, pyridoxal 5′-phosphate, cysteine and ATP are the precursors of coenzyme A. These enzymes catalyze not only the formation of pantothenate but also the production of (R)-pantoic acid, (R)-pantolactone, (R)-panthenol (provitamin B₅), pantetheine (and its derivatives) and coenzyme A.

The biosynthesis of biotin from the precursor molecule pimeloyl-CoA in microorganisms has been investigated in detail, and several of the genes involved have been identified. It has emerged that many of the corresponding proteins are involved in the Fe cluster synthesis and belong to the class of nifS proteins. Liponic acid is derived from octanonic acid and serves as coenzyme in energy metabolism where it is a constituent of the pyruvate dehydrogenase complex and of the α-ketoglutarate dehydrogenase complex. Folates are a group of substances all derived from folic acid which in turn is derived from L-glutamic acid, p-aminobenzoic acid and 6-methylpterin. The biosynthesis of folic acid and its derivatives starting from the intermediate products of the biotransformation of guanosine 5′-triphosphate (GTP), L-glutamic acid and p-aminobenzoic acid has been investigated in detail in certain microorganisms.

Corrinoids (such as the cobalamines and, in particular, vitamin B₁₂) and the porphyrins belong to a group of chemicals distinguished by a tetrapyrrole ring system. The biosynthesis of vitamin B₁₂ is so complex that it has not yet been completely characterized, but many of the enzymes and substrates involved are now known. Nicotinic acid (nicotinate) and nicotinamide are pyridine derivatives which are also referred to as “niacin”. Niacin is the precursor of the important coenzymes NAD (nicotinamide adenine dinucleotide) and NADP (nicotinamide adenine dinucleotide phosphate) and their reduced forms.

Production of these compounds on the industrial scale is mostly based on cell-free chemical syntheses, although some of these chemicals, such as riboflavin, vitamin B₆, pantothenate and biotin have likewise been produced by large-scale cultivation of microorganisms only vitamin B₁₂ is, because of the complexity of its synthesis, produced only by fermentation. In vitro processes require a considerable expenditure of materials and time and frequently high costs.

C. Purine, Pyrimidine, Nucleoside and Nucleotide Metabolism and Uses

Genes for purine and pyrimidine metabolism and their corresponding proteins are important aims for the therapy of oncoses and viral infections. The term “purine” or “pyrimidine” comprises nitrogen-containing bases which form part of nucleic acids, coenzymes and nucleotides. The term “nucleotide” encompasses the fundamental structural units of nucleic acid molecules, which comprise a nitrogen-containing base, a pentose sugar (the sugar is ribose in the case of RNA and the sugar is D-deoxyribose in the case of DNA) and phosphoric acid. The term “nucleoside” comprises molecules which serve as precursors of nucleotides but have, in contrast to the nucleotides, no phosphoric acid unit. It is possible to inhibit RNA and DNA synthesis by inhibiting the biosynthesis of these molecules or their mobilization to form nucleic acid molecules; targeted inhibition of this activity in cancer cells allows the ability of tumor cells to divide and replicate to be inhibited. There are also nucleotides which do not form nucleic acid molecules but serve as energy stores (i.e. AMP) or as coenzymes (i.e. FAD and NAD).

Several publications have described the use of these chemicals for these medical indications, the purine and/or pyrimidine etabolism being influenced (for example Christopherson, R. I. and Lyons, S. D. (1990) “Potent inhibitors of de novo pyrimidine and purine biosynthesis as chemotherapeutic agents”, Med. Res. Reviews 10: 505-548). Investigations of enzymes involved in purine and pyrimidine metabolism have concentrated on the development of novel medicaments which can be used, for example, as immunosuppressants or antiproliferative agents (Smith, J. L. “Enzymes in Nucleotide Synthesis” Curr. Opin. Struct. Biol. 5 (1995) 752-757; Biochem. Soc. Transact. 23 (1995) 877-902). However, purine and pyrimidine bases, nucleosides and nucleotides also have other possible uses: as intermediate products in the biosynthesis of various fine chemicals (e.g. thiamine, S-adenosylmethionine, folates or riboflavin), as energy carriers for the cell (for example ATP or GTP) and for chemicals themselves which are ordinarily used as flavor enhancers (for example IMP or GMP) or for many medical applications (see, for example, Kuninaka, A., (1996) “Nucleotides and Related Compounds in Biotechnology” Vol. 6, Rehm et al., editors VCH: Weinheim, pp. 561-612). Enzymes involved in purine, pyrimidine, nucleoside or nucleotide metabolism are also increasingly serving as targets against which chemicals are being developed for crop protection, including fungicides, herbicides and insecticides.

The metabolism of these compounds in bacteria has been characterized (for reviews, see, for example, Zalkin, H. and Dixon, J. E. (1992) “De novo purin nucleotide biosynthesis” in Progress in Nucleic Acids Research and Molecular Biology, Vol. 42, Academic Press, pp. 259-287; and Michal, G. (1999) “Nucleotides and Nucleosides”; Chapter 8 in : Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, Wiley, N.Y.). Purine metabolism, the object of intensive research, is essential for normal functioning of the cell. Disordered purine metabolism in higher animals may cause severe illnesses, for example gout. Purine nucleotides are synthesized from ribose 5-phosphate by a number of steps via the intermediate compound inosine 5′-phosphate (IMP), leading to the production of guanosine 5′-monophosphate (GMP) or adenosine 5′-monophosphate (AMP), from which the triphosphate forms used as nucleotides can easily be prepared. These compounds are also used as energy stores, so that breakdown thereof provides energy for many different biochemical processes in the cell. Pyrimidine biosynthesis takes place via formation of uridine 5′-monophosphate (UMP) from ribose 5-phosphate. UMP in turn is converted into cytidine 5′-triphosphate (CTP). The deoxy forms of all nucleotides are prepared in a one-step reduction reaction from the diphosphate ribose form of the nucleotide to give the diphosphate deoxyribose form of the nucleotide. After phosphorylation, these molecules can take part in DNA synthesis.

D. Trehalose Metabolism and Uses

Trehalose consists of two glucose molecules linked together by an α,α-1,1 linkage. It is ordinarily used in the food industry as sweetener, as additive for dried or frozen foods and in beverages. However, it is also used in the pharmaceutical industry or in the cosmetics industry and biotechnology industry (see, for example, Nishimoto et al., (1998) U.S. Pat. No. 5,759,610; Singer, M. A. and Lindquist, S. Trends Biotech. 16 (1998) 460-467; Paiva, C. L. A. and Panek, A. D. Biotech Ann. Rev. 2 (1996) 293-314; and Shiosaka, M. J. Japan 172 (1997) 97-102). Trehalose is produced by enzymes of many microorganisms and is naturally released into the surrounding medium from which it can be isolated by methods known in the art.

II. Elements and Methods of the Invention

The present invention is based, at least partially, on the detection of new molecules which are referred to herein as MCP nucleic-acid molecules. These MCP nucleic-acid molecules are suitable not only for identifying C. glutamicum or related bacterial species but also as markers for mapping the C. glutamicum genome and for identifying the bacteria which are suitable for producing fine chemicals, for example via fermentative processes. The present invention is also based, at least partially, on the MCP-protein molecules which are encoded by said MCP nucleic-acid molecules. These MCP molecules are capable of modulating the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, of serving as identification markers for C. glutamicum or related organisms, of degrading hydrocarbons and of serving as targets for developing therapeutic pharmaceutical compounds. In one embodiment, the MCP molecules of the invention are involved directly or indirectly in the metabolic pathway of one or more fine chemicals in C. glutamicum. In a preferred embodiment, the action of the MCP molecules of the invention of being involved indirectly or directly in such metabolic pathways has an effect on the production of a fine chemical of interest by said microorganism. In a particularly preferred embodiment, the activity of the MCP molecules of the invention is modulated such that the C. glutamicum metabolic pathways in which the MCP proteins of the invention are involved are modulated with respect to the efficiency or the output, and this modulates directly or indirectly the production or the efficiency of production of a fine chemical of interest by C. glutamicum.

The term “MCP protein” or “MCP polypeptide” comprises proteins which can modulate the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, degrade hydrocarbons, oxidize terpenoids and serve as target protein for drug screening or drug design or as identification markers for C. glutamicum or related organisms. Examples of MCP proteins comprise those which are encoded by the MCP genes listed in Table 1 and Appendix A. The terms “MCP gene” and “MCP nucleic acid sequence” comprise nucleic acid sequences encoding an MCP protein which comprises a coding region and corresponding untranslated 5′ and 3′ sequence regions. Examples of MCP genes are those listed in Table 1. The terms “production” and “productivity” are known in the art and include concentration of the fermentation product (for example the fine chemical of interest) which is produced within a predetermined time interval and a predetermined fermentation volume (e.g. kg of product per h per 1). The term “efficiency of production” comprises the time required for attaining a particular production quantity (for example, the time required by the cell for reaching a particular throughput rate of a fine chemical). The term “yield” or “product/carbon yield” is known in the art and comprises the efficiency of converting the carbon source into the product (i.e. the fine chemical). This is, for example, usually expressed as kg of product per kg of carbon source. Increasing the yield or production of the compound increases the amount of the molecules obtained or of the suitable obtained molecules of this compound in a particular amount of culture over a predetermined period of time. The terms “biosynthesis” and “biosynthetic pathway” are known in the art and comprise the synthesis of a compound, preferably an organic compound from intermediates by a cell, for example in a multistep process or highly regulated process. The terms “degradation” and “degradation pathway” are known in the art and comprise cleavage of a compound, preferably an organic compound, into degradation products (in more general terms: smaller or less complex molecules) by a cell, for example in a multistep process or highly regulated process. The term “metabolism” is known in the art and comprises the entirety of biochemical reactions which take place in an organism. The metabolism of a particular compound (e.g. the metabolism of an amino acid such as glycine) then comprises all biosynthetic, modification and degradation pathways in the cell, which are related to this compound.

In another embodiment, the MCP molecules of the invention are capable of modulating directly or indirectly the production of a molecule of interest, such as a fine chemical, in a microorganism such as C. glutamicum. Using gene recombination technique it is possible to manipulate one or more MCP proteins of the invention such that their function is modulated. This modulation of the function may lead to the modulation of the yield, production and/or efficiency of production of one or more fine chemicals from C. glutamicum.

For example, it is possible, by modifying the activity of a protein which is involved in the biosynthesis or degradation of a fine chemical (i.e. by mutagenesis of the corresponding gene), to directly modulate the ability of the cell to synthesize or degrade said compound, thereby modulating the yield and/or the efficiency of production of the fine chemical. Likewise it is possible, by modulating the activity of a protein which regulates a fine-chemical metabolic pathway, to influence directly, whether the production of the desired compound is up or down regulated, both of which modulate the yield or efficiency of production of the fine chemical from the cell.

The indirect modulation of the production of fine chemicals may also be carried out by modifying the activity of a protein of the invention (i.e. by mutagenesis of the corresponding gene) so that the ability of the cell to grow and to divide or to remain viable or productive is increased overall. The production of fine chemicals from C. glutamicum is usually achieved by large-scale fermentative culturing of said microorganisms conditions which are frequently suboptimal for growth and cell division. Modifying a protein of the invention (e.g. a stress reaction protein, a cell wall protein or proteins which are involved in the metabolism of compounds which are necessary for cell division and cell growth to take place, such as nucleotides and amino acids) such that survival, growth and propagation under said conditions can be improved, can make it possible to increase the number and the productivity of said modified C. glutamicum cells in large-scale cultures, and this in turn should increase the yields and/or the efficiency of production of one or more of the desired fine chemicals. Furthermore, the metabolic pathways of a cell are necessarily dependent on one another and coregulated. Changing the activity of any metabolic pathway in C. glutamicum (i.e. changing the activity of one of the proteins of the invention, which is involved in such a pathway) makes it possible to change simultaneously the activity or regulation of another metabolic pathway in this microorganism, which may be involved directly in the synthesis or degradation of a fine chemical.

The isolated nucleic acid sequences of the invention are located in the genome of a Corynebacterium glutamicum strain which can be obtained from the American Type Culture Collection under the name ATCC 13032. The nucleotide sequence of the isolated C. glutamicum MCP nucleic-acid molecules and the predicted amino acid sequences of the C. glutamicum MCP proteins are shown in Appendix A and Appendix B, respectively. Computer analyses were carried out which classified and/or identified many of these nucleotide sequences as sequences with homologies to E. coli or Bacillus subtilis genes.

The present invention also relates to proteins whose amino acid sequence is essentially homologous to an amino acid sequence in Appendix B. As used herein, a protein whose amino acid sequence is essentially homologous to a selected amino acid sequence is at least about 50% homologous to the selected amino acid sequence, for example to the entire selected amino acid sequence. A protein whose amino acid sequence is essentially homologous to a selected amino acid sequence may also be at least about 50-60%, preferably at least about 60-70%, more preferably at least about 70-80%, 80-90% or 90-95% and most preferably at least about 96%, 97%, 98%, 99% or even more homologous to the selected amino acid sequence.

An MCP protein of the invention or a biologically active section or fragment thereof is capable of modulating the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, of degrading hydrocarbons, or oxidizing terpenoids, of serving as target for drug development or of serving as identification marker for C. glutamicum or related organisms.

The following subsections describe various aspects of the invention in more detail:

A. Isolated Nucleic Acid Molecules

One aspect of the invention relates to isolated nucleic acid molecules which encode MCP molecules or biologically active sections thereof and also to nucleic acid fragments which are sufficient for the use as hybridization probes or primers for identifying or amplifying MCP-encoding nucleic acids (e.g. MCP DNA). These nucleic acid molecules may be used for identifying C. glutamicum or related organisms, for mapping the genome of C. glutamicum or related organisms or for identifying microorganisms which are suitable for producing fine chemicals, for example via fermentative processes. The term “nucleic acid molecule”, as used herein, is intended to comprise DNA molecules (e.g. cDNA or genomic DNA) and RNA molecules (e.g. mRNA) and also DNA or RNA analogs generated by means of nucleotide analogs.

Moreover, this term comprises the untranslated sequence located at the 3′ and 5′ ends of the coding gene region: at least about 100 nucleotides of the sequence upstream of the 5′ end of the coding region and at least about 20 nucleotides of the sequence downstream of the 3′ end of the coding gene region. The nucleic acid molecule may be single-stranded or double-stranded but is preferably double-stranded DNA. An “isolated” nucleic acid molecule is removed from other nucleic acid molecules which are present in the natural source of the nucleic acid. An “isolated” nucleic acid preferably does not have any sequences which flank the nucleic acid naturally in the genomic DNA of the organism from which the nucleic acid originates (for example, sequences located at the 5′ or 3′ end of the nucleic acid). In various embodiments, the isolated MCP nucleic acid molecule may have, for example, less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of the nucleotide sequences which naturally flank the nucleic acid molecule in the genomic DNA of the cell from which the nucleic acid originates (e.g. a C. glutamicum cell). In addition to this, an “isolated” nucleic acid molecule such as a cDNA molecule may essentially be free of other cellular material or culture medium, if prepared by recombinant techniques, or free of chemical precursors or other chemicals, if chemically synthesized.

A nucleic acid molecule of the invention, for example a nucleic acid molecule having a nucleotide sequence of Appendix A or a section thereof, may be isolated by means of molecular biological standard techniques and by the sequence information provided herein. For example, a C. glutamicum MCP cDNA may be isolated from a C. glutamicum bank by using a complete sequence from Appendix A or a section thereof as hybridization probe and by using standard hybridization techniques (as described, for example, in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). Moreover, a nucleic acid molecule comprising a complete sequence from Appendix A or a section thereof can be isolated via polymerase chain reaction, using the oligonucleotide primers produced on the basis of said sequence (for example, it is possible to isolate a nucleic acid molecule comprising a complete sequence from Appendix A or a section thereof via polymerase chain reactions by using oligonucleotide primers which have been produced on the basis of this same sequence from Appendix A). For example, mRNA can be isolated from normal endothelial cells (for example via the guanidinium-thiocyanate extraction method of Chirgwin et al. (1979) Biochemistry 18:5294-5299), and the cDNA can be prepared by means of reverse transcriptase (e.g. Moloney-MLV Reverse Transcriptase, available from Gibco/BRL, Bethesda, Md., or AMV Reverse Transcriptase, available from Seikagaku America, Inc., St. Petersburg, Fla.) and by means of random polynucleotide primers or oligonucleotide primers on the basis of any of the nucleotide sequences shown in Appendix A. Synthetic oligonucleotide primers for amplification via polymerase chain reaction can be produced on the basis of any of the nucleotide sequences shown in Appendix A. A nucleic acid of the invention may be amplified by means of cDNA or, alternatively, genomic DNA as template and suitable oligonucleotide primers according PCR standard amplification techniques. The nucleic acid amplified in this way may be cloned into a suitable vector and characterized by DNA sequence analysis. Oligonucleotides corresponding to an MCP nucleotide sequence may also be prepared by standard syntheses using, for example, an automatic DNA synthesizer.

In a preferred embodiment, an isolated nucleic acid molecule of the invention comprises any of the nucleotide sequences listed in Appendix A. The sequences of Appendix A correspond to the inventive MCP cDNAs from Corynebacterium glutamicum. These cDNAs comprise sequences, the MCP proteins (i.e. the “coding region” which is indicated in each sequence in Appendix A), and also the 5′ and 3′ untranslated sequences which are likewise indicated in Appendix A. As an alternative, the nucleic acid molecule may comprise only the coding region of any of the sequences in Appendix A.

In addition to this, the nucleic acid molecule of the invention may comprise only one section of the coding region of any of the sequences in Appendix A, for example a fragment can be used as probe or primer or fragment which encodes a biologically active section of an MCP protein. The nucleotide sequences determined from cloning the C. glutamicum MCP genes make it possible to generate probes and primers which are designed for identifying and/or cloning MCP homologs in other cell types and organisms and MCP homologs of other Corynebacteria or related species. The probe or the primer usually comprises essentially purified oligonucleotide. The oligonucleotide usually comprises a nucleotide sequence region which hybridizes under stringent conditions to at least about 12, preferably about 25, more preferably about 40, 50 or 75 successive nucleotides of a sense strand of any of the sequences indicated in Appendix A, of an antisense strand of any of the sequences indicated in Appendix A or naturally occurring mutants thereof. Primers on the basis of a nucleotide sequence of Appendix A may be used in PCR reactions for cloning MCP homologs. Probes on the basis of the MCP nucleotide sequences may be used for detecting transcripts or genomic sequenes which encode the same or homologous proteins. In preferred embodiments, the probe moreover comprises a labeling group bound thereto, for example a radioisotope, a fluorescent compound, an enzyme or an enzyme cofactor. These probes may be used as part of a diagnostic assay kit for identifying cells with faulty expression of an MCP protein, for example by measuring a quantity of an MCP-encoding nucleic acid in a cell sample, for example by detecting the MCP mRNA level or by determining whether a genomic MCP gene has been mutated or deleted.

In one embodiment, the nucleic acid molecule of the invention encodes a protein or a section thereof comprising an amino acid sequence which is sufficiently homologous to an amino acid sequence of Appendix B that the protein or a section thereof retains the ability to modulate the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, to degrade hydrocarbons, to oxidize terpenoids, to serve as target for drug development or to serve as identification marker for C. glutamicum or related organisms. The term “sufficiently homologous”, as used herein, relates to proteins or sections thereof whose amino acid sequences have a minimum number of identical or equivalent (for example an amino acid residue having a side chain similar to that of an amino acid residue in any of the sequences of Appendix B) amino acid residues, compared to an amino acid sequence of Appendix B, such that the protein or a section thereof is capable of modulating the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, of degrading hydrocarbons, of oxidizing terpenoids, of serving as target for drug development or of serving as identification marker for C. glutamicum or related organisms. Examples of these activities are likewise described herein. Thus, the “function of an MCP protein” contributes to the overall regulation of the metabolic pathway of one or more fine chemicals or to the degradation of a hydrocarbon or to the oxidation of a terpenoid.

Sections of proteins encoded by the MCP nucleic acid molecules of the invention are preferably biologically active sections of any of the MCP proteins. The term “biologically active section of an MCP protein”, as used herein, is intended to comprise a section, for example a domain or a motif of an MCP protein, which/ which modulates the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, degrades hydrocarbons, oxidizes terpenoids and serves as a target for drug development or as an identification marker for C. glutamicum or related organisms. In order to determine whether an MCP protein or a biologically active section thereof is able to modulate the yield, production and efficiency of production of one or more fine chemicals in C. glutamicum, to degrade hydrocarbons or to oxidize terpenoids, an enzyme activity assay may be carried out. These assay methods, as described in detail in Example 8 of the examples, are familiar to the skilled worker.

Additional nucleic acid fragments encoding biologically active sections of an MCP protein can be prepared by isolating a section of any of the sequences in Appendix B, expressing the encoded section of the MCP protein or MCP peptide (for example by recombinant expression in vitro) and determining the activity of the encoded section of the MCP protein or MCP peptide.

Moreover, the invention comprises nucleic acid molecules which differ from any of the nucleotide sequences shown in Appendix A (and sections thereof) due to the degeneracy of the genetic code and thus encode the same MCP protein as the one encoded by the nucleotide sequences shown in Appendix A. In another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide sequence which encodes a protein having an amino acid sequence shown in Appendix B. In a further embodiment, the nucleic acid molecule of the invention encodes a full-length C. glutamicum protein which is essentially homologous to an amino acid sequence of Appendix B (encoded by an open reading frame shown in Appendix A).

In addition to naturally occurring MCP-sequence variants which may exist in the population the skilled worker likewise understands that changes can be introduced into a nucleotide sequence of Appendix A via mutation, leading to a change in the amino acid sequence of the encoded MCP protein, without impairing the functionality of the MCP protein. Thus it is possible, for example, to prepare in a sequence of Appendix A nucleotide substitutions which lead to amino acid substitutions at “nonessential” amino acid residues. A “nonessential” amino acid residue is a residue which can be altered in the wild-type sequence of any of the MCP proteins (Appendix B) without modifying the activity of said MCP protein, whereas an “essential” amino acid residue is required for MCP-protein activity. However, other amino acid residues (for example nonconserved or merely semiconserved amino acid residues in the domain with MCP activity) may be nonessential for activity and can therefore probably be modified without modifying the MCP activity.

Consequently, another aspect of the invention relates to nucleic acid molecules encoding MCP proteins which contain modified amino acid residues which are nonessential for MCP activity. The amino acid sequence of these MCP proteins differs from a sequence in Appendix B but the proteins nevertheless retain at least one of the MCP activities described herein. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein which comprises an amino acid sequence having at least about 50% homology to an amino acid sequence of Appendix B and which is capable of modulating the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum, of degrading hydrocarbons, of oxidizing terpenoids, of serving as a target for drug development or of serving as an identification marker for C. glutamicum or related organisms.

An isolated nucleic acid molecule encoding an MCP protein which is homologous to a protein sequence of Appendix B may be generated by introducing one or more nucleotide substitutions, additions or deletions into a nucleotide sequence of Appendix A so that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein. The mutations may be introduced into any of the sequences of Appendix A by standard techniques such as site-directed mutagenesis and PCR-mediated mutagenesis. Preference is given to introducing conservative amino acid substitutions at one or more of the predicted nonessential amino acid residues. A “conservative amino acid substitution” replaces the amino acid residue by an amino acid residue with a similar side chain. Families of amino acid residues with similar side chains have been defined in the art. These families comprise amino acids with basic side chains (e.g. lysine, arginine, histidine), acidic side chains (e.g. aspartic acid, glutamic acid), uncharged polar side chains (e.g. glycine, aspargine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g. alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g. threonine, valine, isoleucine) and aromatic side chains (e.g. tyrosine, phenylalanine, tryptophan, histidine). A predicted nonessential amino acid residue in an MCP protein is thus preferably replaced by another amino acid residue of the same side-chain family. In another embodiment, the mutations may alternatively be introduced randomly over the entire or over part of the MCP-encoding sequence, for example by saturation mutagenesis, and the resultant mutants may be tested for an MCP activity described herein, in order to identify mutants maintaining MCP activity. After mutagenesis of any of the sequences of Appendix A, the encoded protein may be expressed recombinantly, and the activity of said protein may be determined using, for example, the assays described herein (see Example 8 of the examples).

B. Recombinant Expression Vectors and Host Cells

Another aspect of the invention relates to vectors, preferably expression vectors, containing a nucleic acid which encodes an MCP protein (or a section thereof). The term “vector”, as used herein, relates to a nucleic acid molecule capable of transporting another nucleic acid to which it is bound. One type of vector is a “plasmid” which term means a circular double-stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, and here additional DNA segments can be ligated into the viral genome. Certain vectors are capable of replicating autonomously in a host cell into which they have been introduced (for example, bacterial vectors with bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g. nonepisomal mammalian vectors) are integrated into the genome of a host cell when introduced into said host cell and thereby replicated together with the host genome. Moreover, particular vectors are capable of controlling the expression of genes to which they are functionally linked. These vectors are referred to here as “expression vectors”. Normally, expression vectors used in DNA recombination techniques are in the form of plasmids. In the present description, “plasmids” and “vectors” may be used interchangeably, since the plasmid is the most commonly used type of vector. However, the present invention is intended to comprise other types of expression vectors such as viral vectors (for example replication-deficient retroviruses, adenoviruses and adenovirus-related viruses), which exert similar functions.

The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form which is suitable for expressing said nucleic acid in a host cell, meaning that the recombinant expression vectors comprise one or more regulatory sequences which are selected on the basis of the host cells to be used for expression and which are functionally linked to the nucleic acid sequence to be expressed. In a recombinant expression vector the term “functionally linked” means that the nucleotide sequence of interest is bound to the regulatory sequence(s) such that expression of said nucleotide sequence is possible (for example in an in vitro transcription/translation system or in a host cell, if the vector has been introduced into said host cell). The term “regulatory sequence” is intended to comprise promoters, repressor-binding sites, activator-binding sites, enhancer regions and other expression control elements (e.g. terminators, other elements of the m-RNA secondary structure or polyadenylation signals). These regulatory sequences are described, for example in Goeddel: Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences comprise those which control the constitutive expression of a nucleotide sequence in many types of host cells and those which control expression of the nucleotide sequence only in particular host cells. The skilled worker understands that designing an expression vector may depend on factors such as the choice of host cell to be transformed, the desired extent of protein expression, etc. The expression vectors of the invention may be introduced into the host cells such as to prepare proteins or peptides, including the fusion proteins or fusion peptides which are encoded by the nucleic acids as described herein (e.g. MCP proteins, mutated forms of MCP proteins, fusion proteins, etc.).

The recombinant expression vectors of the invention may be designed for expressing MCP proteins in prokaryotic or eukaryotic cells. For example, MCP genes may be expressed in bacterial cells such as C. glutamicum, insect cells (using Baculovirus expression vectors), yeast cells and other fungal cells (see Romanos, M. A. et al. (1992) “Foreign gene expression in yeast: a review”, Yeast 8: 423-488; van den Hondel, C. A. M. J. J. et al. (1991) “Heterologous gene expression in filamentous fungi” in: More Gene Manipulations in Fungi, J. W. Bennet & L. L. Lasure, Editors, pp. 396-428: Academic Press: San Diego; und van den Hondel, C. A. M. J. J. & Punt, P. J. (1991) “Gene transfer systems and vector development for filamentous fungi” in: Applied Molecular Genetics of Fungi, Peberdy, J. F. et al., Editors, pp. 1-28, Cambridge University Press: Cambridge), algal and multicellular plant cells (see Schmidt, R. and Willmitzer, L. (1988) “High efficiency Agrobacterium tumefaciens-mediated transformation of Arabidopsis thaliana leaf and cotyledon explants” Plant Cell Rep.: 583-586) or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector may be transcribed and translated in vitro, for example by using T7 promoter regulatory sequences and T7 polymerase.

Proteins are expressed in prokaryotes mainly by using vectors containing constitutive or inducible promoters which control the expression of fusion or nonfusion proteins. Fusion vectors contribute a number of amino acids to a protein encoded therein, usually at the amino terminus of the recombinant protein. These fusion vectors usually have three tasks: 1) enhancing the expression of recombinant protein; 2) increasing the solubility of the recombinant protein; and 3) supporting the purification of the recombinant protein by acting as a ligand in affinity purification. Often a proteolytic cleavage site is introduced into fusion expression vectors at the junction of fusion unit and recombinant protein so that the recombinant protein can be separated from the fusion unit after purifying the fusion protein. These enzymes and their corresponding recognition sequences comprise factor Xa, thrombin and enterokinase.

Common fusion expression vectors comprise pGEX (Pharmacia Biotech Inc; Smith, D. B. and Johnson, K. S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT 5 (Pharmacia, Piscataway, N.J.), in which glutathione S-transferase (GST), maltose E-binding protein and protein A, respectively, are fused to the recombinant target protein. In one embodiment, the MCP protein-encoding sequence is cloned into a pGEX expression vector such that a vector is generated which encodes a fusion protein comprising, from N terminus to C terminus: GST—thrombin cleavage site—protein X. The fusion protein may be purified via affinity chromatography by means of a glutathione-agarose resin. The recombinant MCP protein which is not fused to GST may be obtained by cleaving the fusion protein with thrombin.

Examples of suitable inducible nonfusion E. coli expression vectors include pTrc (Amann et al., (1988) Gene 69:301-315) and pET 11d (Studier et al. Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 60-89). Target-gene expression from the pTrc vector is based on transcription from a hybrid trip-lac fusion promoter by host RNA polymerase. Target-gene expression from the pET 11d vector is based on transcription from a T7-gn10-lac fusion promoter, which is mediated by a coexpressed viral RNA polymerase (T7 gn1). This viral polymerase is provided in the BL 21 (DE3) or HMS174 (DE3) host strain by a resident λ prophage which harbors a T7 gnl gene under the transcriptional control of the lacUV 5 promoter.

One strategy for maximizing expression of the recombinant protein is to express said protein in a host bacterium whose ability to proteolytically cleave said recombinant protein is disrupted (Gottesman, S. Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 119-128). Another strategy is to modify the nucleic acid sequence of the nucleic acid to be inserted into an expression vector such that the individual codons for each amino acid are those which are preferably used in a bacterium selected for expression, such as C. glutamicum (Wada et al. (1992) Nucleic Acids Res. 20:2111-2118). This modification of the nucleic acid sequences of the invention can be carried out by standard techniques of DNA synthesis.

In a further embodiment, the MCP-protein expression vector is a yeast expression vector. Examples of vectors for expression in the yeast S. cerevisiae include pYepSec1 (Baldari et al., (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz et al. (1987) Gene 54:113-123) and pYES2 (Invitrogen Corporation, San Diego, Calif.). Vectors and methods for constructing vectors which are suitable for use in other fungi such as filamentous fungi include those which are described in detail in: van den Hondel, C. A. M. J. J. & Punt, P. J. (1991) “Gene transfer systems and vector development for filamentous fungi, in: Applied Molecular Genetics of Fungi, J. F. Peberdy et al., Editors, pp. 1-28, Cambridge University Press: Cambridge.

As an alternative, it is possible to express the MCP proteins of the invention in insect cells using Baculovirus expression vectors. Baculovirus vectors available for the expression of proteins in cultured insect cells (e.g. Sf9 cells) include the pAc series (Smith et al., (1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170:31-39).

In a further embodiment, the MCP proteins of the invention may be expressed in unicellular plant cells (such as algae) or in cells of higher plants (e.g. spermatophytes such as crops). Examples of plant expression vectors include those which are described in detail in Becker, D., Kemper, E., Schell, J. and Masterson, R. (1992) “New plant binary vectors with selectable markers located proximal to the left border”, Plant Mol. Biol. 20:1195-1197; and Bevan, M. W. (1984) “Binary Agrobacterium vectors for plant transformation”, Nucl. Acids Res. 12:8711-8721.

In another embodiment, a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987) EMBO J. 6:187-195). When used in mammalian cells, the control functions of the expression vector are often provided by viral regulatory elements. Commonly used promoters are derived, for example, from polyoma, adenovirus 2, cytomegalovirus and simian virus 40. Other suitable expression systems for prokaryotic and eukaryotic cells can be found in Chapters 16 and 17 of Sambrook, J., Fritsch, E. F. and Maniatis, T., Molecular cloning: A Laboratory Manual, 2^(nd) Edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.

In a further embodiment, the recombinant mammalian expression vector may cause expression of the nucleic acid preferably in a particular cell type (for example, tissue-specific regulatory elements are used for expressing the nucleic acid). Tissue-specific regulatory elements are known in the art. Nonlimiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immunol. 43:235-275), in particular promoters of T-cell receptors (Winoto and Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters (e.g. neurofilament promoter; Byrne and Ruddle (1989) PNAS 86:5473-5477), pancreas-specific promoters (Edlund et al., (1985) Science 230:912-916) and mamma-specific promoters (e.g. milk serum promoter; U.S. Pat. No. 4,873,316 and European patent application document No. 264 166). Development-regulated promoters, for example the murine hox promoters (Kessel and Gruss (1990) Science 249:374-379) and the α-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3:537-546), are likewise included.

Moreover, the invention provides a recombinant expression vector comprising an inventive DNA molecule which has been cloned into the expression vector in antisense direction. This means that the DNA molecule is functionally linked to a regulatory sequence such that an RNA molecule which is antisense to MCP mRNA can be expressed (via transcription of the DNA molecule). It is possible to select regulatory sequenes which are functionally bound to a nucleic acid cloned in antisense direction and which control the continuous expression of the antisense RNA molecule in a multiplicity of cell types; for example, it is possible to select viral promoters and/or enhancers or regulatory sequences which control the constitutive tissue-specific or cell type-specific expression of antisense RNA. The antisense expression vector may be in the form of a recombinant plasmid, phagemid or attenuated virus, which produces antisense nucleic acids under the control of a highly effective regulatory region whose activity is determined by the cell type into which the vector is introduced. The regulation of gene expression by means of antisense genes is discussed in Weintraub, H. et al., Antisense-RNA as a molecular tool for genetic analysis, Reviews—Trends in Genetics, Vol. 1(1) 1986.

Another aspect of the invention relates to the host cells into which a recombinant expression vector of the invention has been introduced. The terms “host cell” and “recombinant host cell” are used interchangeably herein. Naturally, these terms relate not only to a particular target cell but also the progeny or potential progeny of this cell. Since particular modifications may appear in successive generations, due to mutation or environmental factors, this progeny is not necessarily identical to the parental cell but is still included in the scope of the term as used herein.

A host cell may be a prokaryotic or eukaryotic cell. For example, an MCP protein may be expressed in bacterial cells such as C. glutamicum, insect cells, yeast or mammalian cells (such as Chinese hamster ovary (CHO) cells or COS cells). Other suitable host cells are familiar to the skilled worker. Microorganisms which are related to Corynebacterium glutamicum and can be used in a suitable manner as host cells for the nucleic acid molecules and protein molecules of the invention are listed in Table 3.

Conventional transformation or transfection methods can be used to introduce vector DNA into prokaryotic or eukaryotic cells. The terms “transformation” and “transfection”, “conjugation” and “transduction”, as used herein, are intended to comprise a multiplicity of methods known in the art for introducing foreign nucleic acid (e.g. DNA) into a host cell, including natural competence, chemically mediated transfer, calcium phosphate or calcium chloride coprecipitation, DEAE-dextran-mediated transfection, lipofection or electroporation. Suitable methods for transformation or transfection of host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual. 2^(nd) Edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989) and other laboratory manuals.

In the case of stable transfection of mammalian cells, it is known that, depending on the expression vector used and transfection technique used, only a small proportion of the cells can integrate the foreign DNA into their genome. These integrants are usually identified and selected by introducing a gene which encodes a selectable marker (e.g. resistance to antibiotics) together with the gene of interest into the host cells. Preferred selectable markers include those which impart resistance to drugs such as G418, hygromycin and methotrexate. A nucleic acid which encodes a selectable marker may be introduced into a host cell on the same vector that encodes an MCP protein or may be introduced on a separate vector. Cells which have been stably transfected with the introduced nucleic acid may be identified for example by drug selection (for example, cells which have integrated the selectable marker survive, whereas the other cells die).

A homologous recombined microorganism is generated by preparing a vector which contains at least one MCP-gene section into which a deletion, addition or substitution has been introduced in order to modify, for example functionally disrupt, the MCP gene. This MCP gene is preferably a Corynebacterium glutamicum MCP gene, but it is also possible to use a homolog from a related bacterium or even from a mammalian, yeast or insect source. In a preferred embodiment, the vector is designed such that homologous recombination functionally disrupts the endogeneous MCP gene (i.e. the gene no longer encodes a functional protein; likewise referred to as “knockout” vector). As an alternative, the vector may be designed such that homologous recombination mutates or otherwise modifies the endogenous MCP gene which, however, still encodes the functional protein (for example, the regulatory region located upstream may be modified such that thereby the expression of the endogenous MCP protein is modified). The modified MCP-gene section in the homologous recombination vector is flanked at its 5′ and 3′ ends by additional nucleic acid of the MCP gene, which makes possible a homologous recombination between the exogenous MCP gene carried by the vector and an endogenous MCP gene in a microorganism. The length of the additional flanking MCP nucleic acid is sufficient for a successful homologous recombination with the endogenous gene. Usually, the vector contains less than one kilobase of flanking DNA (both at the 5′ and the 3′ ends) (see, for example, Thomas, K. R. and Capecchi, M. R. (1987) Cell 51:503 for a description of homologous recombination vectors). The vector is introduced into a microorganism (e.g. by electroporation) and cells in which the introduced MCP gene has homologously recombined with the endogenous MCP gene are selected using methods known in the art.

In another embodiment, it is possible to produce recombinant microorganisms which contain selected systems which make possible a regulated expression of the introduced gene. The insertion of an MCP gene into a vector brings it under the control of the lac operon and thus enables, for example, MCP-gene expression only in the presence of IPTG. These regulatory systems are known in the art.

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, may be used for producing (i.e. expressing) an MCP protein. Moreover, the invention provides methods for producing MCP proteins by using the host cells of the invention.

In one embodiment, the method comprises cultivation of the host cell of the invention (into which a recombinant expression vector encoding an MCP protein has been introduced or in whose genome a gene encoding a wild-type or modified MCP protein has been introduced) in a suitable medium until the MCP protein has been produced. In a further embodiment, the method comprises isolating the MCP proteins from the medium or the host cell.

C. Uses and Methods of the Invention

The nucleic acid molecules, proteins, protein homologs, fusion proteins, primers, vectors and host cells described herein may be used in one or more of the following methods: identification of C. glutamicum and related organisms, mapping of genomes of organisms related to C. glutamicum, identification and localization of C. glutamicum sequences of interest, evolutionary studies, determination of MCP-protein regions required for function, modulation of the activity of an MCP protein; modulation of the activity of one or more metabolic pathways and modulation of the cellular production of a compound of interest, such as a fine chemical. The MCP nucleic acid molecules of the invention have a multiplicity of uses. First, they may be used for identifying an organism as Corynebacterium glutamicum or close relatives thereof. They may also be used for identifying C. glutamicum or a relative thereof in a mixed population of microorganisms. The invention provides the nucleic acid sequences of a number of C. glutamicum genes. Probing the extracted genomic DNA of a culture of a uniform or mixed population of microorganisms under stringent conditions with a probe which comprises a region of a C. glutamicum gene which is unique in this organism makes it possible to determine whether said organism is present. Although Corynebacterium glutamicum itself is nonpathogenic, it is related to pathogenic species such as Corynebacterium diptheriae. The detection of such an organism is of substantial clinical importance.

C. glutamicum can be detected in a sample by using techniques known in the art. In particular, it is possible first to cultivate the cells in the sample in a suitable liquid or on a suitable solid culture medium in order to increase the number of cells in the culture. These cells are lysed and the entire DNA contained therein is extracted and, where appropriate, purified in order to remove cell debris and protein material which could interfere with the subsequent analysis. A polymerase chain reaction or a similar technique known in the art is carried out (for a general review of methodologies usually used for nucleic acid sequence amplification, see Mullis et al., U.S. Pat. No. 4,683,195, Mullis et al., U.S. Pat. No. 4,965,188 and Innis, M. A., und Gelfand, D. H. (1989) PCR-Protocols, A guide to Methods and Applications, Academic Press, pp. 3-12, and (1988) Biotechnology 6:1197, and International patent application No. WO89/01050), with primers specific for an MCP nucleic acid molecule of the invention being incubated with the nucleic acid sample such that said particular MCP nucleic acid sequence is amplified, if present in the sample. The particular nucleic acid sequence to be amplified is selected on the basis of its exclusive presence in the genome of C. glutamicum and only a few closely related bacteria. The presence of the desired amplification product indicates the presence of C. glutamicum or of an organism closely related to C. glutamicum.

Furthermore, the nucleic acid and protein molecules of the invention may serve as markers for specific regions of the genome. Using techniques known in the art, it is possible to detect the physical location of the MCP nucleic acid molecules of the invention on the C. glutamicum genome, and this in turn can be used for easier location of other nucleic acid molecules and genes on the map. Moreover, the nucleic acid molecules of the invention may be sufficiently homologous to the sequences of related species that these nucleic acid molecules may likewise enable construction of a genomic map in such bacteria (e.g. Brevibacterium lactofermentum).

The nucleic acid and protein molecules of the invention are suitable not only for mapping the genome but also for functional studies of C. glutamicum proteins. The genomic region to which a particular C. glutamicum DNA-binding protein binds may be identified, for example, by cleaving the C. glutamicum genome and incubating the fragments with the DNA-binding protein. Those fragments which bind the protein may additionally be probed with the nucleic acid molecules of the invention, preferably by using readily detectable lables; binding of such a nucleic acid molecule to the genomic fragment makes it possible to locate the fragment on the genomic map of C. glutamicum, and carrying out this process several times using different enzymes facilitates rapid determination of the nucleic acid sequence to which the protein binds.

The MCP nucleic acid molecules of the invention are likewise suitable for evolutionary studies and protein structure studies. Many prokaryotic and eukaryotic cells utilize the metabolic processes in which the molecules of the invention are involved; by comparing the sequences of the nucleic acid molecules of the invention with those sequences which encode similar enzymes from other organisms, it is possible to determine the degree of evolutionary relationship of said organisms. Accordingly, such a comparison makes it possible to determine which sequence regions are conserved and which are not, and this may be helpful in determining those regions of the protein which are essential for enzyme function. This type of determination is valuable for protein engineering studies and may give an indication as to which level the protein can tolerate mutagenesis without losing its function.

The MCP proteins of the invention can be used as markers for classifying an unknown bacterium as C. glutamicum or for identifying C. glutamicum or closely related bacteria in a sample. Using techniques known in the art, it is possible, for example, to amplify, where appropriate, cells in a sample (for example by cultivation in a suitable medium) in order to increase the sample size, and then to lyse said cells such that the proteins contained therein are released. This sample may be purified, where appropriate, in order to remove cell debris and nucleic acid molecules which could interfere with the subsequent analysis. Antibodies specific for a selected MCP protein of the invention may be incubated with the protein sample in a typical Western assay format (see, for example, Ausubel et al., (1988) Current Protocols in Molecular Biology, Wiley: N.Y.), with the antibody binding to its target protein if said protein is present in the sample. An MCP protein is selected for this type of assay if it is unique or nearly unique for C. glutamicum or for C. glutamicum and very closely related bacteria. The proteins in the sample are then fractionated via gel electrophoresis and transferred to a suitable matrix such as nitrocellulose. A suitable secondary antibody with a detectable label (e.g. chemiluminescent or colorimetric) is incubated with the matrix, followed by stringent washing. The presence or absence of the label indicates the presence or absence of the target protein in the sample. If the protein is present, then this shows the presence of C. glutamicum. A similar method makes possible the classification of an unknown bacterium as C. glutamicum; if a number of C. glutamicum-specific proteins is not detected in the protein samples prepared from the unknown bacterium, then this bacterium is probably not C. glutamicum.

Genetic manipulation of the MCP nucleic acid molecules of the invention may cause the production of MCP proteins with functional differences to wild-type MCP proteins. These proteins can be improved with respect to their efficiency or activity, they can be present in the cell in larger numbers than usually or they can be weakened with respect to their efficiency or activity.

These changes in activity can directly modulate the yield, production and/or efficiency of production of one or more fine chemicals in C. glutamicum. For example, it is possible, by modifying the activity of a protein which is involved in the biosynthesis or degradation of a fine chemical (i.e. by mutagenesis of the corresponding gene), to directly modulate the ability of the cell to synthesize or degrade said compound, thereby modulating the yield and/or the efficiency of production of the fine chemical. Likewise it is possible, by modulating the activity of a protein which regulates a fine-chemical metabolic pathway, to influence directly, whether the production of the desired compound is up or down regulated, both of which modulate the yield or efficiency of production of the fine chemical from the cell.

The indirect modulation of the production of fine chemicals may also be carried out by modifying the activity of a protein of the invention (i.e. by mutagenesis of the corresponding gene) so that the ability of the cell to grow and to divide or to remain viable or productive is increased overall. The production of fine chemicals from C. glutamicum is usually achieved by large-scale fermentative culturing of said microorganisms conditions which are frequently suboptimal for growth and cell division. Modifying a protein of the invention (e.g. a stress reaction protein, a cell wall protein or proteins which are involved in the metabolism of compounds which are necessary for cell division and cell growth to take place, such as nucleotides and amino acids) such that survival, growth and propagation under said conditions can be improved, can make it possible to increase the number and the productivity of said modified C. glutamicum cells in large-scale cultures, and this in turn should increase the yields and/or the efficiency of production of one or more of the desired fine chemicals. Furthermore, the metabolic pathways of a cell are necessarily dependent on one another and coregulated. Changing the activity of any metabolic pathway in C. glutamicum (i.e. changing the activity of one of the proteins of the invention, which is involved in such a pathway) makes it possible to change simultaneously the activity or regulation of another metabolic pathway in this microorganism, which may be involved directly in the synthesis or degradation of a fine chemical.

These abovementioned strategies for the mutagenesis of MCP proteins, which ought to increase the yields of a fine chemical from C. glutamicum, are not intended to be limiting; variations of these mutagenesis strategies are quite obvious to the skilled worker. Using these strategies and including the mechanisms disclosed herein, it is possible to use the nucleic acid and protein molecules of the invention in order to generate C. glutamicum or related bacterial strains expressing mutated MCP nucleic acid and protein molecules so as to improve the yield, production and/or efficiency of production of a compound of interest. The compound of interest may be any product produced by C. glutamicum, including the end products of biosynthetic pathways and intermediates of naturally occurring metabolic pathways and also molecules which do not naturally occur in the C. glutamicum metabolism but are produced by a C. glutamicum strain of the invention.

The following examples which are not to be understood as being limiting further illustrate the present invention. The contents of all references, patent applications, patents and published patent applications cited in this patent application are hereby incorporated by way of reference.

EXAMPLES Example 1 Preparation of Total Genomic DNA from Corynebacterium glutamicum ATCC13032

A Corynebacterium glutamicum (ATCC 13032) culture was cultivated with vigorous shaking in BHI medium (Difco) at 30° C. overnight. The cells were harvested by centrifugation, the supernatant was discarded and the cells were resuspended in 5 ml of buffer I (5% of the original culture volume—all volumes stated have been calculated for a culture volume of 100 ml). Composition of buffer I: 140.34 g/l sucrose, 2.46 g/l MgSO₄.7H₂O, 10 ml/l KH₂PO₄ solution (100 g/l, adjusted to pH 6.7 with KOH), 50 ml/l M12 concentrate (10 g/l (NH₄)₂SO₄, 1 g/l NaCl, 2 g/l MgSO₄.7H₂O, 0.2 g/l CaCl₂, 0.5 g/l yeast extract (Difco), 10 ml/l trace element mixture (200 mg/l FeSO₄.H₂O, 10 mg/l ZnSO₄.7H₂O, 3 mg/l MnCl₂.4H₂O, 30 mg/l H₃BO₃, 20 mg/l CoCl2.6H₂O, 1 mg/l NiCl₂.6H₂O, 3 mg/l Na₂MoO₄.2H₂O), 500 mg/l complexing agents (EDTA or citric acid), 100 ml/l vitamin mixture (0.2 ml/l biotin, 0.2 mg/l folic acid, 20 mg/l p-aminobenzoic acid, 20 mg/l riboflavin, 40 mg/l Ca panthothenate, 140 mg/l nicotinic acid, 40 mg/l pyridoxol hydrochloride, 200 mg/l myoinositol). Lysozyme was added to the suspension at a final concentration of 2.5 mg/ml. After incubation at 37° C. for approx. 4 h, the cell wall was degraded and the protoplasts obtained were harvested by centrifugation. The pellet was washed once with 5 ml of buffer I and once with 5 ml of TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH value 8). The pellet was resuspended in 4 ml of TE buffer and 0.5 ml of SDS solution (10%) and 0.5 ml of NaCl solution (5 M) were added. After addition of proteinase K at a final concentration of 200 μg/ml, the suspension was incubated at 37° C. for approx. 18 hours. The DNA was purified via extraction with phenol, phenol/chloroform/isoamyl alcohol and chloroform/isoamyl alcohol by means of standard methods. The DNA was then precipitated by addition of {fraction (1/50)} volume of 3 M sodium acetate and 2 volumes of ethanol, subsequent incubation at −20° C. for 30 min and centrifugation at 12 000 rpm in a high-speed centrifuge using an SS34 rotor (Sorvall) for 30 min. The DNA was dissolved in 1 ml of TE buffer containing 20 μg/ml RNase A and dialyed against 1000 ml of TE buffer at 4° C. for at least 3 h. The buffer was exchanged 3 times during this period. 0.4 ml of 2 M LiCl and 0.8 ml of ethanol were added to 0.4 ml aliquots of the dialyed DNA solution. After incubation at −20° C. for 30 min, the DNA was collected by centrifugation (13 000 rpm, Biofuge Fresco, Heraeus, Hanau, Germany). The DNA pellet was dissolved in TE buffer. It was possible to use DNA prepared by this method for all purposes, including Southern blotting and constructing genomic banks.

Example 2 Construction of Genomic Corynebacterium glutamicum (ATCC13032) Banks in Escherichia coli

Starting from DNA prepared as described in Example 1, cosmid and plasmid banks were prepared according to known and well-established methods (see, for example, Sambrook, J. et al. (1989) “Molecular Cloning: A Laboratory Manual”. Cold Spring Harbor Laboratory Press or Ausubel, F. M. et al. (1994) “Current Protocols in Molecular Biology”, John Wiley & Sons).

It was possible to use any plasmid or cosmid. Particular preference was given to using the plasmids pBR322 (Sutcliffe, J. G. (1979) Proc. Natl Acad. Sci. USA, 75: 3737-3741); pACYC177 (Change & Cohen (1978) J. Bacteriol. 134: 1141-1156); pBS series plasmids (pBSSK+, PBSSK− and others; Stratagene, LaJolla, USA) or cosmids such as SuperCosl (Stratagene, LaJolla, USA) or Lorist6 (Gibson, T. J. Rosenthal, A., and Waterson, R. H. (1987) Gene 53: 283-286.

Example 3 DNA Sequencing and Functional Computer Analysis

Genomic banks, as described in Example 2, were used for DNA sequencing according to standard methods, in particular the chain termination method using ABI377 sequencers (see, for example, Fleischman, R. D. et al. (1995) “Whole-genome Random Sequencing and Assembly of Haemophilus Influenzae Rd., Science 269; 496-512). Sequencing primers having the following nucleotide sequences were used: 5′-GGAAACAGTATGACCATG-3′or 5′-GTAAAACGACGGCCAGT-3′.

Example 4 In Vivo Mutagenesis

In vivo mutagenesis of Corynebacterium glutamicum may be carried out by passing a plasmid (or other vector) DNA through E. coli or other microorganisms (e.g. Bacillus spp. or yeasts such as Saccharomyces cerevisiae) which cannot maintain the integrity of their genetic information. Common mutator strains contain mutations in the genes for the DNA repair system (e.g., mutHLS, mutD, mutT, etc., for comparison see Rupp, W. D. (1996) DNA repair mechanisms in Escherichia coli and Salmonella, pp. 2277-2294, ASM: Washington). These strains are known to the skilled worker. The use of these strains is illustrated, for example, in Greener, A. and Callahan, M. (1994) Strategies 7; 32-34.

Example 5 DNA Transfer Between Escherichia coli and Corynebacterium glutamicum

A plurality of Corynebacterium and Brevibacterium species contain endogenous plasmids (such as, for example, pHM1519 or PBL1) which replicate autonomously (for a review see, for example, Martin, J. F. et al. (1987) Biotechnology 5: 137-146). Shuttle vectors for Escherichia coli and Corynebacterium glutamicum can be constructed readily by means of standard vectors for E. coli (Sambrook, J. et al., (1989), “Molecular Cloning: A Laboratory Manual”, Cold Spring Harbor Laboratory Press or Ausubel, F. M. et al. (1994) “Current Protocols in Molecular Biology”, John Wiley & Sons), to which an origin of replication for and a suitable marker from Corynebacterium glutamicum is added. Such origins of replication are preferably taken from endogenous plasmids which have been isolated from Corynebacterium and Brevibacterium species. Particular use as transformation markers for these species are genes for kanamycin resistance (such as those derived from the Tn5 or the Tn903 transposon) or for chloramphenicol (Winnacker, E. L. (1987) “From Genes to Clones—Introduction to Gene Technology, VCH, Weinheim). There are numerous examples in the literature for preparing a large multiplicity of shuttle vectors which are replicated in E. coli and C. glutamicum and which can be used for various purposes, including the overexpression of genes (see, for example, Yoshihama, M. et al. (1985) J. Bacteriol. 162: 591-597, Martin, J. F. et al., (1987) Biotechnology, 5: 137-146 and Eikmanns, B. J. et al. (1992) Gene 102: 93-98).

Standard methods make it possible to clone a gene of interest into one of the above-described shuttle vectors and to introduce such hybrid vectors into Corynebacterium glutamicum strains. C. glutamicum can be transformed via protoplast transformation (Kastsumata, R. et al., (1984) J. Bacteriol. 159, 306-311), electroporation (Liebl, E. et al., (1989) FEMS Microbiol. Letters, 53: 399-303) and, in cases in which specific vectors are used, also via conjugation (as described, for example, in Schafer, A., et al. (1990) J. Bacteriol. 172: 1663-1666). Likewise, it is possible to transfer the shuttle vectors for C. glutamicum to E. coli by preparing plasmid DNA from C. glutamicum (by means of standard methods known in the art) and transforming it into E. coli. This transformation step can be carried out using standard methods but advantageously an Mcr-deficient E. coli strain such as NM522 (Gough & Murray (1983) J. Mol. Biol. 166: 1-19) is used.

Example 6 Determination of the Expression of the Mutated Protein

The observations of the activity of a mutated protein in a transformed host cell are based on the fact that the mutated protein is expressed in a similar manner and in similar quantity to the wild-type protein. A suitable method for determining the amount of transcription of the mutated gene (an indication of the amount of mRNA available for translation of the gene product) is to carry out a Northern blot (see, for example, Ausubel et al., (1988) Current Protocols in Molecular Biology, Wiley: N.Y.), with a primer which is designed such that it binds to the gene of interest being provided with a detectable (usually radioactive or chemiluminescent) label such that—when the total RNA of a culture of the organism is extracted, fractionated on a gel, transferred to a stable matrix and incubated with this probe—binding and binding quantity of the probe indicate the presence and also the amount of mRNA for said gene. This information is an indicator for the extent for which the mutated gene has been transcribed. Total cell RNA can be isolated from Corynebacterium glutamicum by various methods known in the art, as described in Bormann, E. R. et al., (1992) Mol. Microbiol. 6: 317-326.

The presence or the relative amount of protein translated from said mRNA can be determined by using standard techniques such as Western blot (see, for example, Ausubel et al. (1988) “Current Protocols in Molecular Biology”, Wiley, N.Y.). In this method, total cell proteins are extracted, separated by gel electrophoresis, transferred to a matrix such as nitrocellulose and incubated with a probe, for example an antibody, which binds specifically to the protein of interest. This probe is usually provided with a chemiluminescent or calorimetric label which can be detected readily. The presence and the observed amount of label indicate the presence and the amount of the desired mutant protein in the cell.

Example 7 Growth of Genetically Modified Corynebacterium glutamicum—Media and Cultivation Conditions

Genetically modified Corynebacteria are cultivated in synthetic or natural growth media. A number of different growth media for Corynebacteria are known and readily available (Lieb et al. (1989) Appl. Microbiol. Biotechnol. 32: 205-210; von der Osten et al. (1998) Biotechnology Letters 11: 11-16; Patent DE 4 120 867; Liebl (1992) “The Genus Corynebacterium”, in: The Procaryotes, Vol. II, Balows, A., et al., editors Springer-Verlag). These media are composed of one or more carbon sources, nitrogen sources, inorganic salts, vitamins and trace elements. Preferred carbon sources are sugars such as mono-, di- or polysaccharides. Examples of very good carbon sources are glucose, fructose, mannose, galactose, ribose, sorbose, ribulose, lactose, maltose, sucrose, raffinose, starch and cellulose. Sugars may also be added to the media via complex compounds such as molasses or other byproducts from sugar refining. It may also be advantageous to add mixtures of various carbon sources. Other possible carbon sources are alcohols and organic acids, such as methanol, ethanol, acetic acid or lactic acid. Nitrogen sources are usually organic or inorganic nitrogen compounds or materials containing these compounds. Examples of nitrogen sources include ammonia gas and ammonium salts such as NH₄Cl or (NH₄)₂SO₄, NH₄OH, nitrates, urea, amino acids and complex nitrogen sources such as cornsteep liquor, soya meal, soya protein, yeast extracts, meat extracts and others.

Inorganic salt compounds which may be present in the media include the chloride, phosphorus or sulfate salts of calcium, magnesium, sodium, cobalt, molybdenum, potassium, manganese, zinc, copper and iron. Chelating agents may be added to the medium in order to keep the metal ions in solution. Particularly suitable chelating agents include dihydroxyphenols such as catechol or protocatechuate and organic acids such as citric acid. The media usually also contain other growth factors such as vitamins or growth promoters, examples of which include biotin, riboflavin, thiamine, folic acid, nicotinic acid, panthothenate and pyridoxine. Growth factors and salts are frequently derived from complex media components such as yeast extract, molasses, cornsteep liquor and the like. The exact composition of the media heavily depends on the particular experiment and is decided upon individually for each case. Information on the optimization of media can be found in the textbook “Applied Microbiol. Physiology, A Practical Approach” (editors P. M. Rhodes, P. F. Stanbury, IRL Press (1997) pp. 53-73, ISBN 0 19 963577 3). Growth media can also be obtained from commercial suppliers, for example Standard 1 (Merck) or BHI (brain heart infusion, DIFCO) and the like.

All media components are sterilized, either by heat (20 min at 1.5 bar and 121° C.) or by sterile filtration. The components may be sterilized either together or, if required, separately. All media components may be present at the start of the cultivation or added continuously or batchwise, as desired.

The cultivation conditions are defined separately for each experiment. The temperature should be between 15° C. and 45° C. and may be kept constant or may be altered during the experiment. The pH of the medium should be in the range from 5 to 8.5, preferably around 7.0 and may be maintained by adding buffers, to the media. An example of a buffer for this purpose is a potassium phosphate buffer. Synthetic buffers such as MOPS, HEPES; ACES, etc. may be used alternatively or simultaneously. Addition of NaOH or NH₄OH can also keep the pH constant during cultivation. If complex media components such as yeast extract are used, the demand for additional buffers decreases, since many complex compounds have a high buffer capacity. In the case of using a fermenter for cultivating microorganisms, the pH may also be regulated using gaseous ammonia.

The incubation period is usually in a range from several hours to several days. This time is selected such that the maximum amount of product accumulates in the broth. The growth experiments disclosed may be carried out in a multiplicity of containers such as microtiter plates, glass tubes, glass flasks or glass or metal fermenters of different sizes. For the screening of a large number of clones, the microorganisms should be grown in microtiter plates, glass tubes or shaker flasks either with or without baffles. Preference is given to using 100 ml shaker flasks which are filled with 10% (based on volume) of the required growth medium. The flasks should be shaken on an orbital shaker (amplitude 25 mm) at a speed in the range of 100-300 rpm. Losses due to evaporation can be reduced by maintaining a humid atmosphere; alternatively, the losses due to evaporation should be corrected mathematically.

If genetically modified clones are investigated, an unmodified control clone or a control clone containing the basic plasmid without insert should also be assayed. The medium is inoculated to an OD₆₀₀ of 0.5-1.5, with cells being used which have been grown on agar plates such as CM plates (10 g/l glucose, 2.5 g/l NaCl, 2 g/l urea, 10 g/l polypeptone, 5 g/l yeast extract, 5 g/l meat extract, 22 g/l agar pH 6.8 with 2 M NaOH) which have been incubated at 30° C. The media are inoculated either by introducing a saline solution of C. glutamicum cells from CM plates or by adding a liquid preculture of said bacterium.

Example 8 In Vitro Analysis of the Function of Mutated Proteins

The determination of the activities and kinetics of enzymes is well known in the art. Experiments for determining the activity of a particular modified enzyme must be adapted to the specific activity of the wild-type enzyme, and this is within the capabilities of the skilled worker. Overviews regarding enzymes in general and also specific details concerning the structure, kinetics, principles, methods, applications and examples of the determination of many enzyme activities can be found, for example, in the following references: Dixon, M., and Webb, E. C: (1979) Enzymes, Longmans, London; Fersht (1985) Enzyme Structure and Mechanism, Freeman, N.Y.; Walsh (1979) Enzymatic Reaction Mechanisms. Freeman, San Francisco; Price, N. C., Stevens, L. (1982) Fundamentals of Enzymology. Oxford Univ. Press: Oxford; Boyer, P. D: editor (1983) The Enzymes, 3rd edition, Academic Press, New York; Bisswanger, H. (1994) Enzymkinetik, 2nd edition VCH, Weinheim (ISBN 3527300325); Bergmeyer, H. U., Bergmeyer, J., Graβ1, M. editors (1983-1986) Methods of Enzymatic Analysis, 3rd edition, Vol. I-XII, Verlag Chemie: Weinheim; and Ullmann's Encyclopedia of Industrial Chemistry (1987) Vol. A9, “Enzymes”, VCH, Weinheim, pp. 352-363.

The activities of proteins binding to DNA can be measured by many well-established methods such as DNA bandshift assays (which are also referred to as gel retardation assays). The action of these proteins on the expression of other molecules can be measured using reporter gene assays (as described in Kolmar, H. et al., (1995) EMBO J. 14: 3895-3904 and in references therein). Reporter gene assay systems are well-known and established for applications in prokaryotic and eukaryotic cells, with enzymes such as beta-galactosidase, green fluorescent protein and several other enzymes being used.

The activity of membrane transport proteins can be determined according to the techniques described in Gennis, R. B. (1989) “Pores, Channels and Transporters”, in Biomembranes, Molecular Structure and Function, Springer: Heidelberg, pp. 85-137; 199-234; and 270-322.

Example 9 Analysis of the Influence of Mutated Protein on the Production of the Product of Interest

The effect of the genetic modification in C. glutamicum on the production of a compound of interest (such as an amino acid) can be determined by growing the modified microorganisms under suitable conditions (such as those described above) and testing the medium and/or the cellular components for increased production of the product of interest (i.e. an amino acid). Such analytical techniques are well known to the skilled worker and include spectroscopy, thin-layer chromatography, various types of coloring methods, enzymic and microbiological methods and analytical chromatography such as high performance liquid chromatography (see, for example, Ullman, Encyclopedia of Industrial Chemistry, Vol. A2, pp. 89-90 and pp. 443-613, VCH: Weinheim (1985); Fallon, A., et al., (1987) “Applications of HPLC in Biochemistry” in: Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 17; Rehm et al. (1993) Biotechnology, Vol. 3, Chapter III: “Product recovery and purification”, pp. 469-714, VCH: Weinheim; Belter, P. A. et al. (1988) Bioseparations: downstream processing for Biotechnology, John Wiley and Sons; Kennedy, J. F. and Cabral, J. M. S. (1992) Recovery processes for biological Materials, John Wiley and Sons; Shaeiwitz, J. A. and Henry, J. D. (1988) Biochemical Separations, in Ullmann's Encyclopedia of Industrial Chemistry, Vol. B3; Chapter 11, pp. 1-27, VCH: Weinheim; and Dechow, F. J. (1989) Separation and purification techniques in biotechnology, Noyes Publications).

In addition to measuring the end product of the fermentation, it is likewise possible to analyze other components of the metabolic pathways, which are used for producing the compound of interest, such as intermediates and byproducts, in order to determine the overall efficiency of production of the compound. The analytical methods include measuring the amounts of nutrients in the medium (for example sugars, hydrocarbons, nitrogen sources, phosphate and other ions), measuring biomass composition and growth, analyzing the production of common metabolites from biosynthetic pathways and measuring gases generated during fermentation. Standard methods for these measurements are described in Applied Microbial Physiology; A Practical Approach, P. M. Rhodes and P. F. Stanbury, editors IRL Press, pp. 103-129; 131-163 and 165-192 (ISBN: 0199635773) and the references therein.

Example 10 Purification of the Product of Interest from a C. glutamicum Culture

The product of interest may be obtained from C. glutamicum cells or from the supernatant of the above-described culture by various methods known in the art. If the product of interest is not secreted by the cells, the cells may be harvested from the culture by slow centrifugation, and the cells may be lysed by standard techniques such as mechanial force or sonication. The cell debris is removed by centrifugation and the supernatant fraction which contains the soluble proteins is obtained for further purification of the compound of interest. If the product is secreted by the C. glutamicum cells, the cells are removed from the culture by slow centrifugation and the supernatant fraction is retained for further purification.

The supernatant fraction from both purification methods is subjected to chromatography using a suitable resin, and either the molecule of interest is retained on the chromatography resin while many contaminants in the sample are not or the contaminants remain on the resin while the sample does not. If necessary, these chromatography steps can be repeated using the same or different chromatography resins. The skilled worker is familiar with the selection of suitable chromatography resins and the most effective application for a particular molecule to be purified. The purified product may be concentrated by filtration or ultrafiltration and stored at a temperature at which product stability is highest.

In the art, many purification methods are known which are not limited to the above purification method and which are described, for example, in Bailey, J. E. & Ollis, D. F. Biochemical Engineering Fundamentals, McGraw-Hill: New York (1986).

The identity and purity of the isolated compounds can be determined by techniques of the art. These techniques comprise high performance liquid chromatography (HPLC), spectroscopic methods, coloring methods, thin-layer chromatography, NIRS, enzyme assays or microbiological assays. These analytical methods are compiled in: Patek et al. (1994) Appl. Environ. Microbiol. 60: 133-140; Malakhova et al. (1996) Biotekhnologiya 11: 27-32; and Schmidt et al. (1998) Bioprocess Engineer. 19: 67-70. Ulmann's Encyclopedia of Industrial Chemistry (1996) Vol. A27, VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, 575-581 and pp. 581-587; Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) Applications of HPLC in Biochemistry in: Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 17.

Equivalents

The skilled worker knows, or can identify by using simply routine methods a large number of equivalents of the specific embodiments of the invention. These equivalents are intended to be included in the patent claims below.

The information in Table 1 is to be understood as follows:

In column 1, “DNA-ID”, the relevant number refers in each case to the SEQ ID NO of the enclosed sequence listing. Consequently, “5” in column “DNA-ID” is a reference to SEQ ID NO:5.

In column 2, “AA-ID”, the relevant number refers in each case to the SEQ ID NO of the enclosed sequence listing. Consequently, “6” in column “AA-ID” is a reference to SEQ ID NO:6.

In column 3, “Identification”, an unambiguous internal name for each sequence is listed.

In column 4, “AA-POS”, the relevant number refers in each case to the amino acid position of the polypeptide sequence “AA-ID” in the same row. Consequently, “26” in column “AA-POS” is amino acid position 26 of the polypeptide sequence indicated accordingly. Position counting starts at the N terminals with +1.

In column 5, “AA wild-type”, the relevant letter refers in each case to the amino acid, displayed in the one-letter code, at the position in the corresponding wild-type strain, which is indicated in column 4.

In column 6, “AA mutant”, the relevant letter refers in each case to the amino acid, displayed in the one-letter code, at the position in the corresponding mutant strain, which is indicated in column 4.

In column 7, “Function”, the physiological function of the corresponding polypeptide sequence is listed.

One-letter code of the proteinogenic amino acids:

-   A Alanine -   C Cysteine -   D Aspartic acid -   E Glutamic acid -   F Phenylalanine -   G Glycine -   H His -   I Isoleucine -   K Lysine -   L Leucine -   M Methionine -   N Asparagine -   P Proline -   Q Glutamine -   R Arginine -   S Serine -   T Threonine -   V Valine -   W Tryptophan

Y Tyrosine TABLE 1 Genes coding for novel proteins AA DNA AA AA wild AA ID: ID: Identification: pos: type mutant Function: 1 2 RXA00033 158 L F TRANSPORTER 382 G E TRANSPORTER 3 4 RXA00054 613 A V DNA/RNA HELICASE (DEAD/DEAH BOX FAMILY) 5 6 RXA00056 262 G D Hypothetical Exported Protein 7 8 RXA00065 101 S F hypothetical protein 9 10 RXA00068 35 S N hypothetical protein 11 12 RXA00077 374 A T HUNTINGTIN INTERACTING PROTEIN HYPE 13 14 RXA00080 70 G D INTEGRAL MEMBRANE PROTEIN 253 P S INTEGRAL MEMBRANE PROTEIN 15 16 RXA00110 172 S F Hypothetical Membrane Spanning Protein 17 18 RXA00118 53 V I Hypothetical Cytosolic Protein 19 20 RXA00121 164 G D hypothetical protein 21 22 RXA00150 85 G D Hypothetical Membrane Spanning Protein 23 24 RXA00151 110 V I hypothetical protein 25 26 RXA00155 231 P S hypothetical protein 27 28 RXA00185 508 D N Hypothetical Cytosolic Protein 29 30 RXA00197 322 A V MEMBRANE METALLOPROTEASE 31 32 RXA00199 12 P S hypothetical protein 370 L F hypothetical protein 33 34 RXA00211 44 V I GLUTAMYL-TRNA(GLN) AMIDOTRANSFERASE SUBUNIT B (EC 6.3.5.—) 35 36 RXA00220 122 G D ACETYLTRANSFERASE (EC 2.3.1.—) 37 38 RXA00240 8 R K hypothetical protein 39 40 RXA00354 67 G E Hydrolase (HAD superfamily) 41 42 RXA00397 236 A V XAA-PRO AMINOPEPTIDASE (EC 3.4.11.9) 43 44 RXA00409 217 G D TRANSPORTER 45 46 RXA00416 223 A V Hypothetical Membrane Spanning Protein 47 48 RXA00424 4 R K hypothetical protein 49 50 RXA00451 78 S N CYTOSINE DEAMINASE (EC 3.5.4.1) 51 52 RXA00476 274 G E hypothetical protein 287 G R hypothetical protein 53 54 RXA00486 80 R H TRANSCRIPTIONAL REGULATORY PROTEIN, LYSR FAMILY 55 56 RXA00493 363 A T 60 KD CHAPERONIN GROEL 57 58 RXA00496 340 P L hypothetical protein 59 60 RXA00503 186 A T putative sugar kinase 61 62 RXA00507 224 A V LIPASE (EC 3.1.1.3) 63 64 RXA00510 17 G S hypothetical protein 65 66 RXA00550 554 S F PENICILLIN-BINDING PROTEIN 1A 67 68 RXA00552 295 A T THIOSULFATE SULFURTRANSFERASE (EC 2.8.1.1) 69 70 RXA00576 157 E K hypothetical protein 71 72 RXA00611 291 I T TRANSPORTER 73 74 RXA00614 86 P S PHOSPHOESTERASE 75 76 RXA00637 148 E K TRANSCRIPTIONAL REGULATOR, MERR FAMILY 77 78 RXA00654 399 S F INNER MEMBRANE PROTEIN 79 80 RXA00678 127 A T hypothetical protein 81 82 RXA00691 214 A T Hypothetical Cytosolic Protein 83 84 RXA00719 306 D N GTP-BINDING PROTEIN 468 A T GTP-BINDING PROTEIN 85 86 RXA00731 550 A T TRANSPORTER 776 L F TRANSPORTER 87 88 RXA00740 93 A T hypothetical protein 89 90 RXA00812 106 A V INHIBITION OF MORPHOLOGICAL DIFFERENTIATION 91 92 RXA00831 166 A T Hypothetical Cytosolic Protein 93 94 RXA00835 54 A V Hypothetical Membrane Spanning Protein 95 96 RXA00917 635 D N hypothetical protein 97 98 RXA00943 55 P S hypothetical protein 123 L F hypothetical protein 99 100 RXA00963 211 P S 2-HYDROXYHEPTA-2,4-DIENE-1,7-DIOATE ISOMERASE (EC 5.3.3.—)/ 5-CARBOXYMETHYL-2- OXO-HEX-3-ENE-1,7-DIOATE DECARBOXYLASE (EC 4.1.1.—) 101 102 RXA01011 201 A T Hypothetical Cytosolic Protein 103 104 RXA01017 37 V I Hypothetical Cytosolic Protein 105 106 RXA01037 28 G D hypothetical protein 107 108 RXA01046 408 A V ATP-DEPENDENT CLP PROTEASE ATP-BINDING SUBUNIT CLPA 109 110 RXA01109 113 A V hypothetical protein 111 112 RXA01122 104 R C hypothetical protein 113 114 RXA01127 60 D N 3-ISOPROPYLMALATE DEHYDROGENASE (EC 1.1.1.85) 115 116 RXA01128 221 T I TAURINE-BINDING PERIPLASMIC PROTEIN PRECURSOR 117 118 RXA01129 2 V I cyclic nucleotide binding protein/2 CBS domains 119 120 RXA01173 146 A T Hypothetical Secreted Protein 121 122 RXA01174 174 L F Hypothetical Membrane Glycoprotein 123 124 RXA01184 363 R K ABC TRANSPORTER INTEGRAL MEMBRANE PROTEIN 430 G D ABC TRANSPORTER INTEGRAL MEMBRANE PROTEIN 125 126 RXA01196 81 G D Hypothetical Cytosolic Protein 127 128 RXA01263 74 S F O-ANTIGEN ACETYLASE 129 130 RXA01268 150 D N CAPM PROTEIN 131 132 RXA01271 307 S F CAPD PROTEIN 131 132 RXA01271 525 L F CAPD PROTEIN 133 134 RXA01273 106 A V Hypothetical Membrane Associated Protein 135 136 RXA01315 20 A T TRANSCRIPTIONAL REGULATOR, TETR FAMILY 101 G D TRANSCRIPTIONAL REGULATOR, TETR FAMILY 137 138 RXA01318 67 A V putative membrane protein 139 140 RXA01326 103 A V hypothetical protein 141 142 RXA01331 13 T I Hypothetical Membrane Spanning Protein 403 P S Hypothetical Membrane Spanning Protein 544 T I Hypothetical Membrane Spanning Protein 143 144 RXA01400 549 G R Hypothetical Cytosolic Protein 145 146 RXA01425 85 E K 60 KDA INNER MEMBRANE PROTEIN YIDC 147 148 RXA01434 957 S F VIRULENCE FACTOR MVIN 1012 P S VIRULENCE FACTOR MVIN 149 150 RXA01439 116 L F ACETYLTRANSFERASE (EC 2.3.1.—) 151 152 RXA01441 104 C R hypothetical protein 153 154 RXA01448 159 G D INTEGRAL MEMBRANE PROTEIN 155 156 RXA01518 148 A T Hypothetical Membrane Spanning Protein 157 158 RXA01536 121 A V hypothetical protein 308 A V hypothetical protein 375 A T hypothetical protein 159 160 RXA01544 78 V I hypothetical protein 161 162 RXA01548 153 G D hypothetical protein 163 164 RXA01549 31 S G Hypothetical Membrane Spanning Protein 53 A T Hypothetical Membrane Spanning Protein 159 A T Hypothetical Membrane Spanning Protein 165 166 RXA01554 124 G D GLUCAN ENDO-1,3-BETA-GLUCOSIDASE A1 PRECURSOR (EC 3.2.1.39) 486 G R GLUCAN ENDO-1,3-BETA-GLUCOSIDASE A1 PRECURSOR (EC 3.2.1.39) 167 168 RXA01574 123 P S GLYCOSYL TRANSFERASE 169 170 RXA01585 112 V M hypothetical protein Rv2474c 171 172 RXA01590 1432 E K hypothetical protein 173 174 RXA01628 67 D N Hypothetical Membrane Glycoprotein 224 G D Hypothetical Membrane Glycoprotein 175 176 RXA01642 165 G E EXTRACELLULAR SUBTILISIN-LIKE PROTEASE PRECURSOR (EC 3.4.21.—) 294 A T EXTRACELLULAR SUBTILISIN-LIKE PROTEASE PRECURSOR (EC 3.4.21.—) 177 178 RXA01647 112 D N Hypothetical Membrane Associated Protein 167 G S Hypothetical Membrane Associated Protein 179 180 RXA01672 56 G D Hypothetical Membrane Spanning Protein 68 L F Hypothetical Membrane Spanning Protein 181 182 RXA01677 209 G R THIOL:DISULFIDE INTERCHANGE PROTEIN DSBA 183 184 RXA01693 102 A T Hypothetical Cytosolic Protein 185 186 RXA01741 17 D N hypothetical protein 187 188 RXA01749 189 A V INTEGRAL MEMBRANE PROTEIN WITH TRKA DOMAINS 212 P S INTEGRAL MEMBRANE PROTEIN WITH TRKA DOMAINS 420 A T INTEGRAL MEMBRANE PROTEIN WITH TRKA DOMAINS 189 190 RXA01754 596 D N hypothetical protein 191 192 RXA01761 441 R C hypothetical protein 1376 G S hypothetical protein 193 194 RXA01770 243 D N DNA HELICASE II (EC 3.6.1.—) 195 198 RXA01771 222 P S Membrane protease subunits, stomatin/prohibitin homologs 197 198 RXA01776 169 L F Hypothetical Membrane Spanning Protein 199 200 RXA01778 81 S F hypothetical protein 201 202 RXA01779 229 T A Hypothetical Exported Protein 203 204 RXA01789 104 L F hypothetical protein 205 206 RXA01791 60 A V hypothetical protein 207 208 RXA01809 128 R Q Hypothetical Membrane Associated Protein 209 210 RXA01825 75 G D Hypothetical Membrane Spanning Protein 211 212 RXA01842 246 P S QUINONE OXIDOREDUCTASE (EC 1.6.5.5) 213 214 RXA01899 233 S N Hypothetical Cytosolic Protein 215 216 RXA01905 103 V I Hypothetical Exported Protein 219 220 RXA01910 220 G S hypothetical protein 221 222 RXA01945 629 E K Hypothetical Exported Protein 1382 G D Hypothetical Exported Protein 223 224 RXA01966 46 A V OLIGORIBONUCLEASE (EC 3.1.—.—) 225 226 RXA01982 341 A T hypothetical protein 227 228 RXA02021 305 S F 2.3,4.5-TETRAHYDROPYRIDINE-2-CARBOXYLATE N- SUCCINYLTRANSFERASE (HOMOLOG) 229 230 RXA02032 208 G S INTEGRAL MEMBRANE PROTEIN 231 232 RXA02053 66 H Y DRGA PROTEIN 233 234 RXA02058 188 S F hypothetical protein 235 236 RXA02059 24 G R Hypothetical Membrane Spanning Protein 237 238 RXA02070 150 A T MRP PROTEIN HOMOLOG 239 240 RXA02084 105 A V hypothetical protein 241 242 RXA02091 170 E K hypothetical protein 243 244 RXA02097 28 A V TRANSCRIPTIONAL REGULATOR 427 G R TRANSCRIPTIONAL REGULATOR 245 246 RXA02104 39 T I GLYCERATE KINASE (EC 2.7.1.31) 247 248 RXA02123 678 L F MAGNESIUM-CHELATASE SUBUNIT CHLI 249 250 RXA02138 88 A V HESB PROTEIN 251 252 RXA02151 165 A T Hypothetical Membrane Spanning Protein 253 254 RXA02163 23 P S hypothetical protein 255 256 RXA02185 104 G S RPF PROTEIN PRECURSOR 257 258 RXA02186 39 S F Hypothetical Cytosolic Protein 259 260 RXA02203 36 D N hypothetical protein 261 262 RXA02206 73 G D OXIDOREDUCTASE (EC 1.1.1.—) 114 A T OXIDOREDUCTASE (EC 1.1.1.—) 314 R C OXIDOREDUCTASE (EC 1.1.1.—) 263 264 RXA02217 65 E K quinate dehydrogenase (pyrroloquinoline-quinone) (EC 1.1.99.25) 92 T I quinate dehydrogenase (pyrroloquinoline-quinone) (EC 1.1.99.25) 265 268 RXA02219 292 V M Hypothetical Cytosolic Protein 267 268 RXA02221 176 L F hypothetical protein 269 270 RXA02267 65 A T DNA (CYTOSINE-5)-METHYLTRANSFERASE (EC 2.1.1.37) 271 272 RXA02271 126 G D Hypothetical Exported Protein 273 274 RXA02280 502 A V HEAT SHOCK PROTEIN HTPG 275 276 RXA02294 57 E K hypothetical protein 277 278 RXA02295 130 G D TRANSPORTER 279 280 RXA02296 61 S N Hypothetical Exported Protein 281 282 RXA02298 447 L F Hypothetical Cytosolic Protein 283 284 RXA02302 279 G E CHLORAMPHENICOL-SENSITIVE PROTEIN RARD 285 286 RXA02303 268 G D hypothetical protein 289 290 RXA02339 38 P S Hypothetical Cytosolic Protein 291 292 RXA02360 243 V I TYPE I RESTRICTION-MODIFICATION SYSTEM DNA METHYLASE 293 294 RXA02362 525 M I TRANSCRIPTIONAL REGULATOR 295 296 RXA02367 3 I V PIRIN 297 298 RXA02387 10 A T Hypothetical Cytosolic Protein 299 300 RXA02390 195 G D THREONINE EFFLUX PROTEIN 301 302 RXA02395 533 A V TRANSPORTER 303 304 RXA02425 34 G E Heavy-Metal Cation Transporter 305 306 RXA02433 9 S N hypothetical protein 307 308 RXA02486 141 T I Hypothetical Cytosolic Protein 309 310 RXA02488 27 A T HEME TRANSPORT ASSOCIATED PROTEIN 311 312 RXA02495 81 G S ABC TRANSPORTER INTEGRAL MEMBRANE PROTEIN 313 314 RXA02570 184 R C ABC TRANSPORTER INTEGRAL MEMBRANE PROTEIN 315 316 RXA02591 377 P S PHOSPHOENOLPYRUVATE CARBOXYKINASE [GTP] (EC 4.1.1.32) 317 318 RXA02604 570 G D Hypothetical Membrane Spanning Protein 319 320 RXA02678 3 M I Hypothetical Cytosolic Protein 321 322 RXA02683 81 G S hypothetical protein 323 324 RXA02721 181 G S Hypothetical Cytosoilc Protein 325 326 RXA02725 208 A V hypothetical protein 327 328 RXA02735 27 P S 6-PHOSPHOGLUCONOLACTONASE (EC 3.1.1.31) 329 330 RXA02736 312 S F PUTATIVE OXPPCYCLE PROTEIN OPCA 331 332 RXA02744 231 M I PUTATIVE INTEGRAL MEMBRANE TRANSPORT PROTEIN 333 334 RXA02757 73 P S hypothetical protein 335 336 RXA02777 306 G D Hypothetical Cytosolic Protein 337 338 RXA02784 4 A V hypothetical protein 339 340 RXA02786 292 E K hypothetical protein 341 342 RXA02798 457 L S SULFITE REDUCTASE (NADPH) FLAVOPROTEIN ALPHA- COMPONENT (EC 1.8.1.2) 343 344 RXA02817 227 G D AEFA PROTEIN 345 346 RXA02825 362 S F HEME TRANSPORT ASSOCIATED PROTEIN 347 348 RXA02951 26 L F Hypothetical Cytosolic Protein 349 350 RXA03010 106 G S hypothetical protein 351 352 RXA03029 6 K E TRANSCRIPTIONAL REGULATOR 7 P A TRANSCRIPTIONAL REGULATOR 8 C I TRANSCRIPTIONAL REGULATOR 353 354 RXA03053 99 M I hypothetical protein 355 356 RXA03098 164 S N DNA alkylation repair enzyme 357 358 RXA03113 374 P L INTEGRAL MEMBRANE PROTEIN WITH TRKA DOMAINS 359 360 RXA03184 109 S F TRANSPORTER 361 362 RXA03316 6 D N 363 364 RXA03343 171 S F 365 366 RXA03377 55 A V 367 368 RXA03463 53 V I 369 370 RXA03506 80 P L Hypothetical Cytosolic Protein 371 372 RXA03523 167 A T 373 374 RXA03572 2 P L 375 376 RXA03577 72 A T 377 378 RXA03584 12 S F 379 380 RXA03591 902 S F 381 382 RXA03613 106 S Y 383 384 RXA03682 18 P S 385 386 RXA03700 139 E K 269 A T 387 388 RXA03803 44 P S 389 390 RXA03823 20 V I 391 392 RXA03886 23 A V 393 394 RXA03940 86 T I 395 396 RXA03947 81 A T 84 S N 397 398 RXA03955 126 T I 399 400 RXA03975 55 V I 401 402 RXA04007 145 S N 403 404 RXA04075 66 G S 173 A V 405 406 RXA04085 55 P S 407 408 RXA04092 18 A V 409 410 RXA04095 30 R K 411 412 RXA04101 40 T I 413 414 RXA04169 21 T I 415 416 RXA04172 46 P S 417 418 RXA04181 40 G D 419 420 RXA04319 40 R C 421 422 RXA04337 19 P S 423 424 RXA04352 2 G S 425 426 RXA04354 50 T I 427 428 RXA04370 102 P L 429 430 RXA04548 85 A V Hypothetical Cytosolic Protein 362 P L Hypothetical Cytosolic Protein 431 432 RXA04549 214 G R 435 436 RXA04642 112 P L 437 438 RXA06004 148 D N 439 440 RXA06040 58 S P 59 I M 93 V E 

1. An isolated nucleic acid molecule, coding for a polypeptide having the amino acid sequence in each case referred to in Table 1/column 2, wherein the nucleic acid molecule in the amino acid position indicated in Table 1/column 4 encodes a proteinogenic amino acid different from the particular amino acid indicated in Table 1/column 5 in the same row.
 2. An isolated nucleic acid molecule as claimed in claim 1, wherein the nucleic acid molecule in the amino acid position indicated in Table 1/column 4 encodes the amino acid indicated in Table 1/column 6 in the same row.
 3. A vector, which comprises at least one nucleic acid sequence as claimed in claim
 1. 4. A host cell, which is transfected with at least one vector as claimed in claim
 3. 5. A host cell as claimed in claim 4, wherein expression of said nucleic acid molecule modulates the production of a fine chemical from said cell.
 6. A method for preparing a fine chemical, which comprises culturing a cell which has been transfected with at least one vector as claimed in claim 3 so that the fine chemical is produced.
 7. A method as claimed in claim 6, wherein the fine chemical is an amino acid.
 8. A method as claimed in claim 7, wherein said amino acid is lysine. 